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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/941,992 recorded 
November 16, 2001, at Reel 012176 and Frame 0450. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PRO809." There exist two related patent applications, (1) U.S. Patent Application 
Serial No. 09/991,073, filed November 14, 2001 (containing claims directed to PRO809 
polypeptides), and (2) U.S. Patent Application Serial No. 09/990,726, filed November 14, 2001 
(containing claims directed to antibodies to the PRO809 polypeptide). These two related 
applications are also under final rejection from the same Examiner and based upon the same 
outstanding rejection, therefore appeal of these final rejections are being pursued independently 
and concurrently herewith. 

III. STATUS OF CLAIMS 
Claims 122-126, 129-131 and 135-138 are in this application. 
Claims 1-123, 125-128 and 132-134 have been canceled. 

Claims 124, 129-131 and 135-138 stand rejected and Appellants appeal the rejection of 
these claims. 

A copy of the rejected claims in the present Appeal is provided in the Claims Appendix. 

IV. STATUS OF AMENDMENTS 

The claims involved in the appeal have been amended by an amendment filed 
concurrently with this appeal brief to cancel Claims 122-123 and 125-126. The claims listed in 
the Appendix incorporate this amendment. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated nucleic acid 

comprising: an isolated nucleic acid of SEQ ID NO: 222; the full-length coding sequence of the 

nucleic acid sequence of SEQ ID NO: 222; or the full-length coding sequence of the cDNA 
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deposited under ATCC accession number 203025 (independent Claim 124). The PRO809 gene 
was shown for the first time to be significantly amplified in human lung cancers as compared to 
normal, non-cancerous human tissue controls (Ct values of 1.05 -1.61, that is, 2.070 to 3.053 fold 
amplification). This is set forth in the specification, at least in the 'Gene Amplification assay,' 
Example 170, page 539, line 19, to page 555, line 5 (specifically, see Table 9A, page 550). The 
profiles of various primary lung tumors used for screening the PRO polypeptide compounds of 
the invention in the gene amplification assa^ are summarized on Table 8, page 546 of the 
specification. This feature is carried by all claims dependent directly or indirectly from 
Claim 124, namely, Claims 129-131 and 135-138. Methods for selecting a host are generally set 
forth in the specification at, for example, in Examples 140-143 and page 376, line 12 onwards 
(Claims 137-138), and describes the expression of PRO nucleic acids in various host cells, 
including E. coli, yeast and Baculovirus-infected insect cells. Methods for selecting a vector are 
generally set forth in the specification at, for example, on page 378, line 8 (Claims 135 and 136). 

Finally, the presently claimed nucleic acid sequence (referred to in the present application 
as "DNA59841-1460") is shown in the present specification as SEQ ID NO: 222 while the 
polypeptide encoded by this polynucleotide sequence is defined as SEQ ID NO: 223, and is 
further described in Figures 150 and 151, respectively. The isolation of cDNA clones encoding 
PRO809 of SEQ ID NO: 222 is described at least in pages 140-142 and in Example 64, page 454 
of the specification. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1. Whether Claims 122-126, 129-131 and 135-138 satisfy the utility/ enablement 
requirement under 35 U.S.C. §§101/112, first paragraph. 

2. Whether Claims 122-124 and 135-138 satisfy the written description requirement 
under 35 U.S.C. §112, first paragraph. 

3. Whether Claims 122-126, 129-131 and 135-138 are patentable under 35 U.S.C. 
§ 102(b) over clone H74302 (EST isolated by Hillier et aL, 1995). 
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VII. ARGUMENTS 

Summary of the Arguments 
Issue 1: Utility/ Enablement 

Claims 122-126, 129-131 and 135-138 stand rejected under 35 U.S.C. §§101/112, first 
paragraph, as allegedly lacking either a specific and substantial asserted utility or a well 
established utility. 

The specification discloses that the gene encoding PRO809 showed amplification, 
ranging from 2.070 to 3.053 fold in different lung primary tumors . The Declaration of 
Dr. Audrey Goddard, submitted with Appellants* Response filed November 3, 2004, explains 
that a gene identified as being amplified at least 2-fold by the disclosed gene amplification assay 
in a tumor sample relative to a normal sample is useful as a marker for the diagnosis of cancer , 
and for monitoring cancer development and/or for measuring the efficacy of cancer therapy. 
Therefore, the gene amplification levels of 2.070 to 3.053 fold for lung primary tumors is 
considered significant. Hence, Appellants submit that one skilled in the art would find it more 
likely than not that PRO809 is useful as a diagnostic tool for detecting certain lung tumors. 

Thus, Appellants submit that, as any skilled artisan in the field of oncology would easily 
appreciate, not all tumor markers are generally associated with every tumor, or even with most 
tumors . Therefore, whether the PRO809 gene is amplified in few tumor samples or in the vast 
majority of tumor samples studied is not relevant to its identification as a tumor marker, or its 
patentable utility. Rather, the fact that the amplification data for PRO809 is considered 
significant is what lends support to its usefulness as a tumor marker. Thus, a positive result does 
indicate the presence of cancer, while a negative result requires further follow up testing, testing 
which is considered routine by one skilled in the art of oncology and is not considered undue. 

Appellants submit the art supports at least one utility for the PRO809 gene, that is, as a 
genetic biomarker for cancer or precancerous cells or damaged tissue. Appellants maintain that 
Sen et al and the art support the Appellants' position that aneuploidy may be a feature of either 
cancerous or pre-cancerous tissue or damaged tissue. For example, Hittelman et al shows that 
"epithelial tumors develop through a multistep process driven by genetic instability" in damaged 
lesions. Therefore, even if the observed PRO809 gene amplification were due to chromosomal 
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aneuploidy (which Appellants do not concede), it would still support at least one utility for the 
PRO809 gene according to Hittelman, because it helps in identifying individuals at significantly 
increased cancer risk. Appellants also submit that one skilled in the art would clearly know that 
early detection of lung cancer would provide information in advance about risk assessment, 
prognosis and therapy for lung cancer. Accordingly, the instant polynucleotides find utility as a 
diagnostic for cancer or for individuals at risk for developing lung cancer. 

The Examiner further asserted on page 3 of the Final Office Action mailed 
February 23, 2004 that amplification of the PRO809 polynucleotide does not impart a specific, 
substantial, and credible utility since, "the literature cautions researchers from drawing 
conclusions based on small changes in transcript expression levels between normal and 
cancerous tissue." In support of this assertion, the Examiner cited reference Hu et al 

First of all, the claims are directed to nucleic acids , not polypeptides, therefore, the issue 
of whether there is a correlation between gene amplification and polypeptide expression levels is 
irrelevant . One of skill in the art would understand how to use the claimed nucleic acids to 
detect amplification of the gene encoding PRO809, and how to use the gene amplification results 
to diagnose cancer. Thus, the question of whether or not PRO809 mRNA or polypeptide levels 
are also increased in these cancers has no relevance to the utility of the claimed nucleic acid 
molecules. Moreover, the teachings of Hu et al, do not conclusively establish a prima facie case 
for lack of utility. 

Instead, Appellants submit that based on the gene amplification data and the substantial, 
credible, asserted utility of the PRO809 gene in the diagnosis of lung cancer, one of ordinary 
skill would know exactly how to make and use these claimed nucleic acids for the diagnosis of 
cancers, without any undue experimentation. 

Issue 2: Written Description 

Claims 122-124 and 135-138 stand rejected under 35 U.S.C. §1 12, first paragraph, as 
allegedly lacking adequate written description for the claimed variant polynucleotides. 

Claims 122-123 have been canceled without prejudice or disclaimer. The polynucleotide 
sequence of SEQ ID NO: 222 claimed in Claim 124 has been reduced to practice. 
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Claims 135-138 depend from Claim 124 and therefore carry all the features of Claim 124. 
Methods for selecting a host are generally set forth in the specification, for example, in 
Examples 140-143 and page 376, line 12 onwards (Claims 137-138), which describes the 
expression of PRO nucleic acids in various host cells, including E. coli, yeast and Baculovirus- 
infected insect cells, and methods for selecting a vector are generally set forth in the specification 
at, for example, on page 378, line 8 (Claims 135 and 136). Accordingly, one skilled in the art 
would understand that the presently claimed invention of Claims 124 and 135-138 were reduced 
to practice at the time of filing of this application. 

Issue 3: Anticipation by Hillier et al. 1995 (EST clone H74302) 

Claims 122-126, 129-131 and 135-138 are rejected under 35 U.S.C. § 102(b) as being 
unpatentable over clone H74302 (EST isolated by Hillier et al., 1995). 

Appellants submit that the Hiller EST clone does not disclose the entire nucleotide 
sequence of SEQ ID NO: 222 nor did they disclose that their EST was part of cDNA (see 
enclosed alignment Item 1, Evidence Appendix). Thus, Hillier et al did not teach nor reduce to 
practice the nucleotide sequence of SEQ ID NO: 222. Accordingly, one of skill in the art would 
not have been able to envision the cDNA clone defined in SEQ ID NO: 222 and therefore, Hillier 
et al does not anticipate the instant invention. 

These arguments are all discussed in further detail below under the appropriate headings. 

Response to Rejections 

ISSUE 1. Claims 122-126, 129-131 and 135-138 are Supported by a Credible. Specificand 
Substantial Asserted Utility, and Thus Meet the Utility Requirement of 35 U.S.C. 
§§101/112, First Paragraph 

The sole basis for the Examiner's rejection of Claims 124, 129-131 and 135-138 under 
this section is that the data presented in Example 170 of the present specification is allegedly 
insufficient under the present legal standards to establish a patentable utility under 35 U.S.C. 
§ 1 0 1 for the presently claimed subject matter. 

Claims 124, 129-131 and 135-138 stand further rejected under 35 U.S.C. §112, first 
paragraph, allegedly "since the claimed invention is not supported by either a specific and 
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substantial asserted utility or a well established utility for the reasons set forth above, one skilled in 
the art clearly would not know how to use the claimed invention." 

Appellants strongly disagree and, therefore, respectfully traverse the rejection. 

A. The Legal Standard For Utility Under 35 U.S.C, § 101 

According to 35 U.S.C. § 101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent applicant disclose a "substantial utility" for his or 

2 

her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 
commerce rather than the realm of philosophy." 

4 

Later, in Nelson v. Bowler, the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson y. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 
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In Cross v. Iizukaf* the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

7 

i.e., there is a reasonable correlation there between/ 1 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants 1 statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 
burden to prove that Appellants 1 claims of usefulness are not believable on their face. In 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope." * 



6 Cross v. lizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 

^d. 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 Ibid. 

11 In re hanger, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Jolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 
F.2d 974, 144 USPQ 351 (1965); In re Sichert, 566 F.2d 1154, 1159, 196 USPQ 209,212-13 
(C.C.P.A. 1977). 
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13 

Compliance with 35 U.S.C. §101 is a question of fact. The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Appellant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 

("Utility Guidelines")/ 5 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an Appellant has identified for the 
invention that can be viewed as providing a public benefit should be accepted as sufficient, at 

least with regard to defining a '"substantial" utility.'" 16 Indeed, the Guidelines for Examination 

13 Raytheon v. Roper, 724 R2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) 
cert denied, 469 US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 

1992). 

15 66 Fed. Reg. 1092(2001). 

16 M.P.E.P. §2107.01. 
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of Applications for Compliance With the Utility Requirement, gives the following instruction 
to patent examiners: "If the Appellant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants respectfully submit that the data presented in Example 170 starting on 
page 539 of the specification of the specification and the cumulative evidence of record, which 
underlies the current dispute, indeed support a "specific, substantial and credible" asserted utility 
for the presently claimed invention. 

Example 170 describes the results obtained using a very well-known and routinely 
employed polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred 
to herein as the gene amplification assay. This assay allows one to quantitatively measure the 
level of gene amplification in a given sample, say, a tumor extract, or a cell line. It was well 
known in the art at the time the invention was made that gene amplification is an essential 
mechanism for oncogene activation. Appellants isolated genomic DNA from a variety of 
primary cancers and cancer cell lines that are listed in Table 9 (pages 539 onwards of the 
specification), including primary lung and colon cancers of the type and stage indicated in 
Table 8 (page 546). The tumor samples were tested in triplicates with Taqman™ primers and 
with internal controls, beta-actin and GADPH in order to quantitatively compare DNA levels 
between samples (page 548, lines 33-34). As a negative control, DNA was isolated from the 
cells of ten normal healthy individuals, which was pooled and used as a control (page 539, 
lines 27-29) and also, no-template controls (page 548, lines 33-34). The results of TaqMan™ 
PCR are reported in ACt units, as explained in the passage on page 539, lines 37-39. One unit 
corresponds to one PCR cycle or approximately a 2-fold amplification,, relative to control, two 
units correspond to 4-fold, 3 units to 8-fold amplification and so on. Using this PCR-based 
assay, Appellants showed that the gene encoding for PRO809 was amplified, that is, it showed 

17 M.P.E.P. §2107 11(B)(1). 
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approximately 1.05- 1.61 ACt units for lung tumors which corresponds to 2 1 05 -2 K61 - fold 
amplification in lung, or 2.070 to 3.053 fold in different lung primary tumors . Appellants submit 
that, one skilled in the art would find it more likely than not that PRO809 is useful as a 
diagnostic tool for detecting certain lung tumors. 

However, the Examiner states regarding the teachings of the Goddard Declaration that 
"the issue. . ..is not whether the technique is sensitive enough to detect a two-fold difference in 
amount of DNA, but rather that such was detected in only a minority of the tested lines of human 
lung tumor cell lines, which increase is likely to be due to anueploidy in the tumorous tissue, and 
is neither diagnostic of cancer, not evidence of overexpression, which is the actual presence of 
extra protein encoded by the nucleic acid." The Examiner further refers to Sen et al (page 4 of 
the Final Office Action) to show that "cancerous tissues are known to be aneuploid" and that 
"pre-cancerous tissues may be aneuploid." The Examiner further states that the Ashkenazi 
Declaration is not persuasive because in the instant case "it has not been established that the 
changes in the amount of DNA were significant. They occurred in only a minority of 
samples. . .and cannot be considered to be diagnostic in the absence of significance, that is a 
statistically significant correlation between increased copy number and cancer. It remains 
that random aneuploidy is the most parsimonious explanation of the results in the specification" 
(page 4, third paragraph, lines 18-24; emphasis added). Appellants respectfully disagree. 

Appellants respectfully point out that the Declaration by Dr. Audrey Goddard presented 

in their response mailed provides a statement by an expert in the relevant art stating that "fold 

amplification" values of at least 2-fold are considered significant in the TaqMan™ PCR gene 

amplification assay. Appellants particularly draw the Board's attention to page 3 of the Goddard 

Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal (z.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy number 
in the tumor sample relative to the normal sample serves as a basis for using 
relative gene copy number as quantitated by the TaqMan PCR technique as a 
diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. Accordingly, a gene identified as being amplified at least 
2-fold by the quantitative TaqMan PCR assay in a tumor sample relative to a 
normal sample is useful as a marker for the diagnosis of cancer, for monitoring 
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cancer development and/or for measuring the efficacy of cancer therapy. 
(Emphasis added). 

Accordingly, the 2.070 to 3.053 fold in the different lung primary tumors would be considered 
significant and credible by one skilled in the art, based upon the facts disclosed in the Goddard 
Declaration. 

Further, as any skilled artisan in the field of oncology would easily appreciate, not all 
tumor markers are generally associated with every tumor, or even, with most tumors. In fact, 
some tumor markers are useful for identifying rare malignancies . That is, the association of the 
tumor marker with a particular type of tumor lesion may be rare, or, the occurrence of that 
particular kind of tumor lesion itself may be rare. In either event, even these rare tumor markers, 
which may not give a positive hit with most common tumors, have great value in tumor 
diagnosis, and consequently, in tumor prognosis . The skilled artisan would know that such 
tumor markers are very useful for better classification of tumors. Therefore, whether the 
PRO809 gene is amplified in a few lung tumors or in most tumors is not relevant to its 
identification as a tumor marker, or its patentable utility. Rather, whether the amplification data 
for PRO809 is significant is what lends support to its usefulness as a tumor marker. It was well 
known in the art at the time of filing of the application that gene amplification, which occurs in 
most solid tumors like lung and colon cancers, is generally associated with poor prognosis. 
Therefore, the PRO809 gene becomes an important diagnostic marker to identify such malignant 
lung or colon cancers, even if the malignancy associated with PRO809 molecule is a rare 
occurrence . Accordingly, the present specification clearly discloses enough evidence that the 
gene encoding the PRO809 polypeptide is significantly amplified in certain types of lung tumors 
and is therefore, a valuable diagnostic marker for identifying certain types of lung cancers. 

Further, Appellants had submitted the Ashkenazi Declaration to show that "detection of 

gene amplification can be used for cancer diagnosis even if the determination includes 

measurement of chromosomal aneuploidy." Regarding Sen, Appellants agree that aneuploidy 

can be a feature of damaged tissue as well, besides cancerous or pre-cancerous tissue, and may 

not invariably lead to cancer. In fact Appellants maintain that Sen et al support the Appellants' 

position that aneuploidy may be a feature of either cancerous or pre-cancerous tissue or damaged 
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tissue. Further, the art shows that "epithelial tumors develop through a multistep process driven 
by genetic instability" in damaged lesions. Appellants provide a reference by Hittelman et ah 
(copy enclosed) to support this view. Hittelman studied damaged or premalignant lesions and 
suggests that epithelial tumors develop through a multistep process driven by genetic instability 
(see Hittelman abstract). Hittelman showed that a subset of the same molecular changes found in 
associated tumors were also found in premalignant lesions, suggesting that these premalignant 
lesions might represent precursor lesions for associated tumors, i.e., a manifestation of a 
multistep tumorigenesis process. (See Hittelman, page 4, last three lines). Appellants submit 
that there is utility in identifying genetic biomarkers in epithelial tissues at cancer risk (also see 
Hittelman, abstract, line 4-7). Hittelman adds on page 2, fourth paragraph, line 3 that "it is 
important to identify individuals at significantly increased cancer risk who might best benefit 
from different types of intervention." 

Taken together, even if the observed PRO809 gene amplification were due to 
chromosomal aneuploidy (which Appellants do not concede), such an observation would still 
support at least one utility for the PRO809 gene according to Hittelman, because it helps in 
identifying individuals at significantly increased cancer risk. Therefore, the art supports at least 
one utility for the PRO809 gene, that is, as a genetic biomarker for cancer or precancerous cells 
or damaged tissue. As one skilled in the art would clearly know, early detection of lung cancer 
provides information in advance about risk assessment, prognosis and therapy for lung cancer. 
Accordingly, the instant polynucleotides find utility as a diagnostic for cancer or for individuals 
at risk for developing lung cancer. 

Appellants also respectfully remind the Examiner that to overcome the presumption of 
truth that an assertion of utility by the Appellant enjoys, the Examiner must establish that it is 
" more likely than not " that one of ordinary skill in the art would doubt the truth of the statement 
of utility. The remarks in this rejection by the Examiner such as " (instant nucleic acid) cannot 
be considered to be diagnostic in the absence of significance, that is a statistically significant 
correlation between increased copy number and cancer " (emphasis added) are a clear indication 
that the Examiner applies a standard that might be appropriate, if the issue at hand were the 
regulatory approval of a diagnostic assay based on the overexpression of PRO809 in lung tumor, 
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but is fully inappropriate for determining if the "utility" standard of the Patent Statute is met. 
The FDA reviewing an application for a new diagnostic assay will indeed ask for actual 
numerical data, statistical analysis, and other specific information before a diagnostic assay is 
approved. However, the Patent and Trademark Office is not the FDA, and the standards of 
patentability are not the same as the standards for market approval. It is well established law that 
therapeutic utility sufficient under the patent laws is not to be confused with the requirements of 
the FDA with regard to safety and efficacy of drugs to be marketed in the United States. Scott v. 
Finney, 34 F.3d 1058, 1063, 32 USPQ2d 1115, 1120 (Fed. Cir. 1994). Indeed, in Nelson v. 
Bowler, 626 F.2d 853, 856, 206 USPQ 881, 883 (CCPA 1980), the Federal Circuit found that the 
identification of a pharmacological activity of a compound provides an "immediate benefit to the 
public" and satisfies the utility requirement. This logically applies to a diagnostic utility as well. 
The identification of a diagnostic utility for a compound should suffice to establish an 
"immediate benefit to the public" and thus to establish patentable utility for PRO809. 

C. A prima facie case of lack of utility has not been established 

The Examiner further cited Hu et al 9 to show that " the literature cautions researchers 
against drawing conclusions based on small changes in transcript expression levels between 
normal and cancerous tissues" (Page 3 of the Final Office Action mailed February 23, 2005). 

First of all, as discussed above, the increase in DNA copy number for the PRO809 gene 
is significant according to the Goddard Declaration. Moreover, the instant invention is directed 
to polynucleotides, not to mRNA or polypeptides. Hence, the teachings of Hu are not relevant to 
the instant invention. Furthermore, even if it was relevant, contrary to the Examiner's assertion, 
the cited Hu et al reference does not conclusively establish a prima facie case for lack of utility 
for the PRO809 molecule for the reasons outlined below. 

The Hu et al reference is entitled "Analysis of Genomic and Proteomic Data using 
Advanced Literature Mining" (emphasis added). Therefore, as the title itself suggests, the 
conclusions in this reference are based upon statistical analysis of information obtained from 
published literature, and not from experimental data. Hu et al performed statistical analysis to 
provide evidence for a relationship between mRNA expression and biological function of a given 
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molecule (as in disease). The conclusions of Hu et al however, only apply to a specific type of 
breast tumor (estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast 
cancer genes in general, let alone to cancer genes in general. Interestingly, the observed 
correlation was only found among ER-positive (breast) tumors not ER-negative tumors." (See 
page 412, left column). 

Moreover, the analytical methods utilized by Hu et al have certain statistical drawbacks, 
as the authors themselves admit. For instance, according to Hu et al, "different statistical 
methods" were applied to "estimate the strength of gene-disease relationships and evaluated the 
results." (See page 406, left column, emphasis added). Using these different statistical methods, 
Hu et al "[a]ssessed the relative strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation." (See page 411, left column). As is well known in the 
art, different statistical methods allow different variables to be manipulated to affect the resulting 
outcome. In this regard, the authors disclose that, "Initial attempts to search the literature" using 
the list of genes, gene names, gene symbols, and frequently used synonyms generated by the 
authors "revealed several sources of false positives and false negatives." (See page 406, right 
column). The authors add that the false positives caused by "duplicative and unrelated meanings 
for the term" were "difficult to manage." Therefore, in order to minimize such false positives, 
Hu et al disclose that these terms "had to be eliminated entirely, thereby reducing the false 
positive rate but unavoidably under-representing some genes." Id. (Emphasis added). Hence, 
Hu et al had to manipulate certain aspects of the input data, in order to generate, in their opinion, 
meaningful results. Further, because the frequency of citation for a given molecule and its 
relationship to disease only reflects the current research interest of a molecule, and not the true 
biological function of the molecule, as the authors themselves acknowledge, the "[relationship 
established by frequency of co-citation do not necessarily represent a true biological link." (See 
page 411, right column). Therefore, based on these findings, the authors add, "[t]his may reflect 
a bias in the literature to study the more prevalent type of tumor in the population. Furthermore, 
this emphasizes that caution must be taken when interpreting experiments that may contain 
subpopulations that behave very differently." Id. (Emphasis added). In other words, some 
molecules may have been underrepresented merely because they were less frequently cited or 
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studied in literature compared to other more well-cited or studied genes. Therefore, Hu et ah's 
conclusions are not based on genes/mRNA in general. 

Appellants submit that, based on the nature of the statistical analysis performed herein, 
and in particular, based on Hu's analysis of only one class of genes, namely, the estrogen 
receptor (ER)-positive breast tumor genes, the conclusions drawn by the Examiner, namely that, 
"genes displaying a 5-fold change or less (mRNA expression) in tumors compared to normal 
showed no evidence of a correlation between altered gene expression and a known role in the 
disease (in general)" is not reliably supported. 

In conclusion, when the proper legal standard is used, a prima facie case of lack of utility 
has not been met based on the cited reference Hu et ah by the Examiner. 

Thus, based on the asserted utility for the PRO809 gene in the diagnosis of selected lung 
tumors, the reduction to practice of the instantly claimed nucleic acid sequence of SEQ ID 
NO:222 in the present application, the disclosure of the step-by-step protocols for making and 
isolating cDNA clones encoding PRO809 of SEQ ID NO:222 at least in Example 64, page 454 
of the specification, the disclosure of a step-by-step protocol for expressing PRO809 cDNA in 
appropriate host cells (in Examples 140-143 and page 376, line 12), the step-by-step protocol of 
the gene amplification assay in Example 170, the skilled artisan would know exactly how to 
make and use the claimed nucleic acids for the diagnosis of lung cancers. That is, Appellants 
submit that based on the detailed information presented in the specification and the advanced 
state of the art in oncology, the skilled artisan would not have found any experimentation 
associated with testing lung tumors, given the present disclosure, 'undue.' 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well-established utility, and since the present specification 
clearly teaches one skilled in the art "how to make and use" the claimed invention without undue 
experimentation, Appellants respectfully request reconsideration and reversal of this outstanding 
rejections under 35 U.S.C. §101 and §112, first paragraph, to Claims 124, 129-131 and 135-138. 
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ISSUE 2. Claims 122-124 and 135-138 Satisfy the Written Description Requirement of 35 
U.S.C. SI 12, First Paragraph 

Claims 122-123 have been canceled without prejudice or disclaimer. The polynucleotide 
sequence of SEQ ID NO:222 claimed in Claim 124 has been reduced to practice and hence, this 
rejection directed to Claim 124 is improper. Claims 135-138 depend from Claim 124 and 
therefore carry all the features of Claim 124. Moreover, methods for selecting a host are 
generally set forth in the specification, for example, in Examples 140-143 and page 376, line 12 
onwards (Claims 137-138), which describes the expression of PRO nucleic acids in various host 
cells, including E. coli, yeast and baculovirus-infected insect cells, and methods for selecting a 
vector are generally set forth in the specification at, for example, on page 378, line 8 (Claims 135 
and 136). Accordingly, one skilled in the art would understand that the Appellants were in 
possession of the presently claimed invention of Claims 124 and 135-138 at the time of filing of 
this application. 

Accordingly, Appellants request reconsideration and reversal of this outstanding 
rejections under 35 U.S.C. §112, first paragraph, for lack of written description. 

ISSUE 3. Claims 122-126, 129-131 and 135-138 are not Anticipated Under 35 U.S.C. 
§102(b) by Hiliier et al, 1995 (EST clone H74302) 

Claims 122-126, 129-131 and 135-138 are rejected under 35 U.S.C. § 102(b) as being 
unpatentable over clone H74302 (EST isolated by Hiliier et aL, 1995). 

Appellants submit that the Hiller EST clone does not disclose the entire nucleotide 
sequence of SEQ ID NO: 222 nor did they disclose that their EST was part of a cDNA sequence 
(see enclosed alignment Item 1, Evidence Appendix). Thus, Hiliier et al. did not teach nor 
reduce to practice the nucleotide sequence of SEQ ID NO:222 as disclosed in the present 
application. Further, one of skill in the art would not have been able to envision the cDNA clone 
defined in SEQ ID NO:222 from the teachings of Hiliier and therefore, Hiliier et al does not 
anticipate the instant invention. 

Accordingly, this rejection under 35 U.S.C. §102(b) should be withdrawn. 
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CONCLUSION 



For the reasons given above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed nucleic acid. As such, Appellants respectfully 
request reconsideration and reversal of the outstanding rejection of Claims 122-126, 129-131 and 



The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 08-1641 (referencing 
Attorney's Docket No. 39780-2730 P1C55 . 



HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 



135-138. 



Respectfully submitted, 



Date: November 22, 2005 
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VIII. CLAIMS APPENDIX 
Claims on Appeal 

124. An isolated nucleic acid comprising: 

(a) the nucleic acid sequence of SEQ ID NO: 222; 

(b) the full-length coding sequence of the nucleic acid sequence of SEQ ED NO: 222; 

or 

(c) the full-length coding sequence of the cDNA deposited under ATCC accession 
number 203025. 

129. The isolated nucleic acid of Claim 124 comprising the nucleic acid sequence of 
SEQ ID NO: 222. 

130. The isolated nucleic acid of Claim 124 comprising the full-length coding 
sequence of the nucleic acid sequence of SEQ ID NO: 222. 

131. The isolated nucleic acid of Claim 124 comprising the full-length coding 
sequence of the cDNA deposited under ATCC accession number 203025. 

135. A vector comprising the nucleic acid of Claim 124. 

136. The vector of Claim 135, wherein said nucleic acid is operably linked to control 
sequences recognized by a host cell transformed with the vector. 

137. A host cell comprising the vector of Claim 135. 

138. The host cell of Claim 137, wherein said cell is a CHO cell, an E. coli or a yeast 

cell. 
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IX. EVIDENCE APPENDIX 

1 . Alignment of the instantly claimed DNA 57836-1338 with EST sequence of 
Hillier e*a/.,.(H74302). 

2. Hittelman et al 2001, Ann. N.Y. Acad. Sci. 952: 1-12. 

3. Declaration of Audrey Goddard, Ph.D. under 35 C.F.R. §1.132, with attached 
Exhibits A-G: 

A. Curriculum Vitae of Audrey D. Goddard, Ph.D. 

B. Higuchi, R. et al., "Simultaneous amplification and detection of specific 
DNA sequences," Biotechnology 10:413-417 (1992). 

C. Livak, K.J., et al., "Oligonucleotides with fluorescent dyes at opposite 
ends provide a quenched probe system useful for detecting PCR product and 
nucleic acid hybridization," PCR Methods Appl 4:357-362 (1995). 

D. Heid, C.A. et al., "Real time quantitative PCR," Genome Res. 6:986-994 
(1996). 

E. Pennica, D. et al, "WISP genes are members of the connective tissue 
growth factor family that are up-regulated in Wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors," Proc. Natl. Acad. Sci. USA 
95:14717-14722(1998). 

F. Pitti, R.M. et al., "Genomic amplification of a decoy receptor for Fas 
ligand in lung and colon cancer," Nature 396:699-703 (1998). 

G. Bieche, I. et al., "Novel approach to quantitative polymerase chain 
reaction using real-time detection: Application to the detection of gene 
amplification in breast cancer," Int. J. Cancer 78:661-666 (1998). 

4. Declaration of Avi Ashkenazi, Ph.D. under 35 C.F.R. §1.132. 

5. Hu et al., "Analysis of genomic and proteomic data using advanced literature 
mining," Journal ofProteome Research 2:405-412 (2003). 

6. Sen S., "Aneuploidy and Cancer", Current Opinion in Oncology, 12: 82-88, 

(2000). 

Item 1 is submitted herewith in response to the Examiner's request for an alignment of the 
claimed nucleic acids with the sequence of the EST clone (see page 6 of Final Office Action 
mailed February 23, 2005). 

Item 2 is submitted herewith to support Appellants' assertion based on the art. 

Items 3-4 were submitted with the Appellants' Response mailed on November 3, 2004. 
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Items 5-6 were made of record by the Examiner in the Office Action mailed June 30, 2005. 
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X. RELATED PROCEEDINGS APPENDIX 



None. 



SV 2147739 vl 

1 1/22/05 1 1 :36 AM (39780.2730) 
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<first sequence: SS.H74302 (length = 582) 
<second sequence: SS.DNA57836 (length = 992) 

<503 matches in an overlap of 562: 89.50 percent similarity 
<gaps in first sequence: 0, gaps in second sequence: 13 (20 bases) 
<score: 1385 (match = 3, mismatch = 0, gap penalty =8+1 per base) 
<endgaps not penalized 

GenBank (Release 149, aug 2005) [Sep 20 13:36:56 2005]: 1 sequence found 

H74302 yu56cll.rl Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 

IMAGE: 230132 5\ mRNA sequence. 582 bp, 
mRNA , linear, EST 31-0CT-1995 

ACCESSION H74302 

VERSION H74302 .1 GI : 1047713 

KEYWORDS EST; 5_prime. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

REFERENCE 1 (bases 1 to 582) 
AUTHORS Hillier,L., Lennon,G., Becker, M., Bonal do , M. F . , Chiapelli ,B. , 
Chissoe,S., Dietrich,N., DuBuque,T. , Favello,A., Gish.w., 
Hawkins, M., Hultman,M., Kucaba,T., Lacy,M., Le , M . , Le , N . , 
Mardis,E., Moore, B., Morris, M., Parsons, J., Prange,C, Rifkin,L. f 
Rohlfinq,T., Schellenberg , K. , soares, M.B., Tan,F., Thierry-Meg, 3 . , 
Trevaskis,E. , Underwood , K. , wohldmann , P. , waterston , R. , wilson,R. 
and Marra,M. 

TITLE Generation and analysis of 280,000 human expressed sequence tags 
JOURNAL Genome Res. 6 (9), 807-828 (1996) 
PUBMED 8889549 
COMMENT Contact: Wilson RK 

Washington university school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Emai 1 : est@watson . wustl . edu 

Insert Size: 1114 

High quality sequence stops: 308 

Source: IMAGE Consortium, LLNL 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information, 
insert Length: 1114 Std Error: 0.00 
Seq primer: M13RP1 
High quality sequence stop: 308. 
FEATURES Locati on/Qual i f i ers 

source 1..582 

/organism= ,, Homo sapiens" 

/mol_type="mRNA" 

/db_xref= n GDB: 3781228" 

/db_xref="taxon:9606" 

/clone="lMAGE:230132" 

/sex="male" 

/dev_stage="20 week-post conception fetus" 
/labJiost="DHlOB (ampicillin resistant)" 
/clone_lib="Soares fetal liver spleen 1NFLS" 
/note="Organ: Liver and Spleen; vector: pT7T3D (Pharmacia) 
with a modified poly! inker; Site_l: Pac I; Site_2: Eco Rl; 
1st strand cDNA was primed w ith a Pac I - oligo (dT) primer 

[5 ' AACTGGAAGAATTAATTAAAGATC I I I I I I I I I I I I I I I I I I I 3'] , 

double-stranded cDNA was li gated to Eco Rl adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco Rl sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 

BASE COUNT 



ORIGIN 



SS.H74302 
SS.DNA57836 

SS.H74302 
SS.DNA57836 

SS.H74302 
SS.DNA57836 

SS.H74302 

SS.DNA57836 

SS.H74302 
SS.DNA57836 

SS.H74302 
SS.DNA57836 

SS.H74302 
ss.DNA57836 

SS.H74302 
SS.DNA57836 

SS.H74302 
ss.DNA-57836 

SS.H74302 
SS.DNA57836 

SS.DNA57836 



10 20 30 40 50 

CAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAXGGCCCTACACCCACCGA 

********************************* ***************** 

GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGA-GGCCCTACACCCACCGA 
10 20 30 40 50 

60 70 80 90 100 110 

GGCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTTCTCCAA 

*************************************************** ******** 

GGCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGC-TTCTCCAA 
60 70 80 90 100 110 

120 130 140 150 160 170 

GGCACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTT 

************************************************************ 

GGCACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTT 
120 130 140 150 160 170 

180 190 200 210 220 230 

CCCCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCAC 

************************************************************ 

CCCCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCAC 
180 190 200 210 220 230 

240 250 260 270 280 290 

CTATTCCCTCTGTGGAACCAAGAACATCAAGGTTGCCAAGAAGGTGGTGAAGACCCACGA 

********************************* ************************** 

CTATTCCCTCTGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGA 
240 250 260 270 280 290 

300 310 320 330 340 350 

GCCGGCCTCCTTCAACCTCAACGTCACACTCAAGTXCAGTCCAGACCTGGCTCACCTAAT 

*********************************** ************ ********* * 

GCCGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAGACCT-GCTCACCTACT 
300 310 320 ' 330 340 350 

360 370 380 390 400 410 

TTATGGCCGGGCGTCCTCCACCTXAGGTGCCCATGTGGGACAGTTXCCAGGGTTACAGAT 

* ****************** ************ ******* * ** ******* 

T- -CTGCCGGGCGTCCTCCACCTCAGGTGCCCATGT-GGACAGT- -GCCAGGCTACAGAT 
360 370 380 390 400 410 

420 430 440 450 460 470 

GCATTGGGGAGGTTGTGGTTCCAAGCCAGTGTXTGAGGTTGCGGGGXCAATTTXAATTTT 

*** **** * ***** ************* **** * ***** * *** 

GCACTGGG- - AGCTGTGG-TCCAAGCCAGTGTCTGAGCT- - -GCGGGCCAACTTCACTCT 
420 430 440 450 460 

480 490 500 5 10 520 530 

GXAGGACAGAGGGGXAAGGXCCAGGGTTXGGAGXTGATTTTGCCAGGX AGGGXAG 

* ************ * * ******* **** *** * ******* * * *** ** 

GCAGGACAGAGGGGCAGGCCCCAGGGT- - GGAGATGA -TCTGCCAGGCGTCCTCGGGCAG 
470 480 490 500 510 520 

540 550 560 570 580 

TCCAATTATTXATCAAAAAGTTXATTTXGGGAAGXTTTGGXAAGGTTXAAT 

*** ** * * *** * * * * ****** ***** * 

CCCACCTA-TCACCAACAGCCTGA--TCGGGAAGGATGGGCAGGTCCACCTGCAGCAGAG 
530 540 550 560 570 580 

ACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCGAGCCAGACATCGGACTGGTT 
590 600 610 620 630 640 



SS . DNA57836 CTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGCCCTCACAGTGGTGCCCCC 

650 660 670 680 690 700 

SS . DNA57836 AGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGAGCCCCATCCTTGCCTT 

710 720 730 740 750 760 

SS . DNA57836 GCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGGGGGTTCAGGATAGG 

770 780 790 800 810 820 

SS . DNA57836 GAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACCGTCCAGAGAGCC 

830 840 850 860 870 880 

SS . DNA57836 AAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGAGTTCATGCAA 

890 900 910 920 930 940 

SS . DNA57836 AATG AGTGTGTTTT AG CTG CTCTTG C C AC AA AAAA AAAA AAAA AAA AAAAA A 

950 960 970 980 990 



<first sequence: SS.H74303 (length = 414) 
<second sequence: ss.DNA57836 (length = 992) 

<196 matches in an overlap of 414: 47.34 percent similarity 
<gaps in first sequence: 8 (20 bases), gaps in second sequence: 3 (6 bases) 
<score: 474 (match = 3, mismatch = 0, gap penalty =8+1 per base) 
<endgaps not penalized 

GenBank (Release 149, aug 2005) [Sep 20 13:35:36 2005]: 1 sequence found 

H74303 yu56cll.sl Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 

IMAGE: 230132 3', mRNA sequence. 414 bp, 
mRNA , linear, EST 31-OCT-1995 

ACCESSION H74303 

VERSION H74303 .1 GI : 1047714 

KEYWORDS EST; 3_prime. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

REFERENCE 1 (bases 1 to 414) 
AUTHORS Hillier,L., i_ennon,G., Becker,M., Bonal do , M. F . , chiapeln.B., 
Chissoe,S., Dietrich, N., DuBuque ,T. , Favello.A., Gish,w., 
Hawkins,M., Hultman.M., Kucaba.T., Lacy,M., Le,M., Le,N. , 
Mardis,E., Moore, B., Morris,M., Parsons, D., Prange,C, Rifkin,L., 
Rohlfing.T., Schellenberg,K. , Soares, M.B., Tan,F., Thierry-Meg, 3 . , 
Trevaskis,E. , Underwood ,K. , wohl dmann , P. , waterston, R. , Wilson, R. 
and Marra,M. 

TITLE Generation and analysis of 280,000 human expressed sequence tags 
JOURNAL Genome Res. 6 (9), 807-828 (1996) 
PUBMED 8889549 
COMMENT contact: Wilson RK 

Washington university school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Emai 1 : est@watson . wustl . edu 

insert Size: 1114 

High quality sequence stops: 313 

Source: IMAGE Consortium, LLNL 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information, 
insert Length: 1114 Std Error: 0.00 
Seq primer: Promega -21ml3 
High quality sequence stop: 313. 
FEATURES Locati on/Qual i f i ers 

source 1. .414 

/orqanism="Homo sapiens" 

/mol_type="mRNA" 

/db_„xref="GDB: 3781228" 

/db_xref="taxon:9606" 

/clone="lMAGE: 230132" 

/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host= M DHl0B (ampicillin resistant)" 
/clone_lib="Soares fetal liver spleen 1NFLS" 
/note="Organ: Liver and Spleen; vector: pT7T3D (Pharmacia) 
with a modified polyl inker; Site_l: Pac I; Site„2: Eco Rl; 
1st strand cDNA was primed w ith a Pac I - oligo (dT) primer 
[ 5 1 AACTGGAAGAATTAATTAAAGATC I I I I I I I I I I I I I I I I I I I 3 ' ] , 

double-stranded cdna was li gated to Eco Rl adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco Rl sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 

BASE COUNT 



ORIGIN 
SS.DNA57836 



SS.DNA57836 



GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAG 
10 20 30 40 50 60 

GCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGG 
70 80 90 100 110 120 



SS.H74303 
SS.DNA57836 

SS.H74303 
SS.DNA57836 

SS.H74303 
SS.DNA57836 

5S.H74303 
SS.DNA57836 

SS.H74303 
SS.DNA57836 

SS.H74303 
SS.DNA57836 

SS.H74303 
S5.DNA57836 

SS.H74303 
SS.DNA57836 

SS.H74303 
SS.DNA57836 

SS.DNA57836 



GTG 



CACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCC 
130 140 150 160 170 180 

10 20 30 40 50 60 

GCAAGAGCAGCTAAAACACACTCATTTTGCATGAACTCCAAATACGAACAGTGCACGCTG 

*** ***** ** * ******* **** ** 

CCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCT 
190 200 210 220 230 240 



ATGGC 

** * 



70 80 90 100 110 120 

CTGCAGTCCTCTGCCGTGCTTGGCTCTCTGGACGGTTCATTCTACATGGCTGC 

*** * * * * ** * ** *** ** ** 

ATTCCCTCTGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGC 
250 260 270 280 290 300 

130 140 150 160 170 

TGCTTTGCGTC-CTCTGACCTCCCCATTC CCTATCCTGAACCCCCCAAACTCCTCT 

* * * ** ********* ** *** ****** * *** 

CGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCT 
310 320 330 340 350 360 

180 190 200 210 220 

TC ACTCAGACGGCGGGTGCTCCTGTAGAGCG - GCAAGGCAAGGATGGGGCT- - C 

* * ** * ***** * *** ** * ** **** * * ** 

GCCGGGCGTCCTCCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGG 
370 380 390 400 410 420 

230 240 250 260 270 280 

TCCAGGGGACCCTGCCAGTCCTCCATCTTCTGGTCACCACCTGGGGGCACCACTGTGAG- 

****** ****** ********* * *** ***** 

AGCTGTGGTCCAAGCCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGG 
430 440 450 460 470 480 

290 300 310 320 330 340 

- - GGCGCTGTGCTGGACATTGGCGTTGTTTGCAGCCTGGCACCAGAACCAGTCCGATGTC 

*** * * **** * ** * ** *** * * * * * 

CAGGCCCCAGGGTGGAGA-TGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAAC 
490 500 510 520 530 

350 360 370 380 390 400 

TGGCTCGGCAGGAAGGAGAAGTTGGCAGG - CTGCCTGTGGATXGGTCTCTGCTGCAGGTG 

* ** * ******* * ****** * **** * * * *** **** 

AGCCTGATCGGGAAGGA TGGGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCA 

540 550 560 570 580 590 

410 

GACCTGCCCAT 

* ****** * 

G - CCTGCCAACTTCTCCTTCCTGCCG AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTG 
600 610 620 630 640 650 

CAAACAACGCCAATGTCCAGCACAGCGCCCTCACAGTGGTGCCCCCAGGTGGTGACCAGA 
660 670 680 690 700 710 



SS.DNA57836 AGATGGAGGACTGGCAGGGTCCCCTGGAGAGCCCCATCCTTGCCTTGCCGCTCTACAGGA 
720 730 740 750 760 770 

SS . DNA57836 GCACCCGCCGTCTGAGTGAAGAGGAGTTTGGGGGGTTCAGGATAGGGAATGGGGAGGTCA 
780 790 800 810 820 830 

SS . DNA57836 GAGGACGCAAAGCAGCAGCCATGTAGAATGAACCGTCCAGAGAGCCAAGCACGGCAGAGG 
840 850 860 870 880 890 

SS.DNA57836 ACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGG AGTTCATGCAAAATG AGTGTGTTTT 
900 910 920 930 940 950 

SS.DNA57836 AGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
960 970 980 990 



<first sequence: SS.H73373 (length = 480) 
<second sequence: SS.DNA57836 (length = 992) 

<456 matches in an overlap of 471: 96.82 percent similarity 
<gaps in first sequence: 0, gaps in second sequence: 7 (9 bases) 
<score: 1303 (match = 3, mismatch = 0, gap penalty =8+1 per base) 
<endgaps not penalized 

GenBank (Release 149, aug 2005)[Sep 20 13:35:00 2005]: 1 sequence found 

H73373 yu48fl0.rl Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 

IMAGE: 229387 5', mRNA sequence. 480 bp, 
mRNA, linear, EST 31-OCT-1995 

ACCESSION H73373 

VERSION H73373 .1 GI : 1047623 

KEYWORDS EST; 5_prime. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

REFERENCE 1 (bases 1 to 480) 
AUTHORS Hillier,L., i_ennon,G. t Becker ,M., Bonaldo,M.F. , Chiapelli,B., 
Chissoe.S., Dietrich, N., DuBuque ,T. , Favello,A., Gish,w., 
Hawkins,M., Hultman,M. f Kucaba,T. , Lacy,M., Le,M. , Le , N . , 
Mardis,E., Moore, B., Morris,M., Parsons, J., Prange,C, Rifkin.L., 
Rohlfinq,T., Schellenberg,K. , Soares, M.B., Tan,F., Thierry-Meg, 3 . , 
Trevaskis,E. , Underwood, K. , wohldmann , P. , waterston,R. , Wilson, R. 
and Marra,M. 

title Generation and analysis of 280,000 human expressed sequence tags 
JOURNAL Genome Res. 6 (9), 807-828 (1996) 
PUBMED 8889549 
COMMENT Contact: Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel : 314 286 1800 

Fax: 314 286 1810 

Emai 1 : est@watson . wust 1 . edu , 

insert Size: 1102 

High quality sequence stops: 306 

Source: IMAGE Consortium, LLNL 

This clone is available royalty-free through LLNL ; contact the 
image consortium (info@image.llnl.gov) for further information, 
insert Length: 1102 Std Error: 0.00 
Seq primer: M13RP1 
High quality sequence stop: 306. 
features Locati on/Qual i f i ers 

source 1..480 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="GDB: 3780483" 

/db_xref="taxon:9606" 

/clone="lMAGE:229387" 

/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host="DHlOB (ampicillin resistant)" 
/clone_lib="Soares fetal liver spleen 1NFLS" 
/note="Organ: Liver and Spleen; Vector : pT7T3D (Pharmacia) 
with a modified polyl inker; Site_l: Pac I; Site_2: Eco Rl; 
1st strand cdna was primed with a Pac I - oligo (dT) primer 
[5' aactggaagaattaattaaagatc I I I I I I I I I I I I I I ! I I I I 3'] , 
double-stranded cdna was li gated to Eco Rl adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco Rl sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bona! do." 

BASE COUNT 



ORIGIN 

SS.H73373 

SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.H73373 
SS.DNA57836 

SS.DNA57836 

SS.DNA57836 

SS.DNA57836 



10 20 30 40 50 

CAGGAAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAG 

* * ********************************************* 

GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAG 
10 20 30 40 50 60 

60 70 80 90 100 110 

GCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGG 

********************************************************* 

GCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGG 
70 80 90 100 110 120 

120 130 140 150 160 170 

CACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCC 

************************************************************ 

CACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCC 
130 140 ' 150 160 170 180 

180 190 200 210 220 230 

CCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCT 

************************************************************ 

CCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCT 
190 200 210 220 230 240 

240 250 260 270 280 290 

ATTCCCTCTGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGC 

************************************************************ 

ATTCCCTCTGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGC 
250 260 270 280 290 300 

300 310 320 330 340 350 

CGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTTCACCTXATTT 

************************************************ ****** ** 

CGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGC-TCACCTACTT- 
310 320 330 340 350 

360 370 380 390 400 410 

CTTGCCGGGCGTTCCTCCACCTCAGGTGCCCATGTGGGACAGTGCCAGGGTTACAGATGG 

********* *********************** *********** ** ******** 

- CTGCCGGGCG - TCCTCCACCTC AGGTG CCCATGT - GG ACAGTGCCA - GGCTACAG ATG C 
360 370 380 390 400 410 

420 430 440 450 460 470 

CATTGGGGAGCTGTGGTCCAAGCCAGTGTTTTGAGGTGCGGGCCAATTTCATTTTGCAGG 

* *********************** * **** ********** **** * ****** 

ACT--GGGAGCTGTGGTCCAAGCCAGTG-TCTGAGCTGCGGGCCAACTTCACTCTGCAGG 
420 430 440 450 460 470 

480 

ACAGAGGG 

******** 

ACAGAGGGGCAGGCCCCAGGGTGGAGATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTA 
480 490 500 510 520 530 

TCACCAACAGCCTGATCGGGAAGGATGGGCAGGTCCACCTGCAGCAGAGACCATGCCACA 
540 550 560 570 580 590 

GGCAGCCTGCCAACTTCTCCTTCCTGCCGAGCCAGACATCGGACTGGTTCTGGTGCCAGG 
600 610 620 630 640 650 

CTGCAAACAACGCCAATGTCCAGCACAGCGCCCTCACAGTGGTGCCCCCAGGTGGTGACC 
660 670 680 690 700 710 



DNA5 7836 AGAAGATGGAGGACTGGCAGGGTCCCCTGGAGAGCCCCATCCTTGCCTTGCCGCTCTACA 
720 730 740 750 760 770 

DNA5 7836 GGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGGGGGTTCAGGATAGGGAATGGGGAGG 
780 790 800 810 820 830 

DNA5 78 3 6 TCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACCGTCCAGAGAGCCAAGCACGGCAG 
840 850 860 870 880 890 

DNA5 7836 AGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGAGTTCATGCAAAATGAGTGTGT 
900 910 920 930 940 950 

DNA57836 TTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
960 970 980 990 



<first sequence: ss.H58326 (length = 418) 
<second sequence: ss.DNA57836 (length = 992) 



<400 matches in an overlap of 415: 96.39 percent similarity 
<gaps in first sequence: 1 (1 base), gaps in second sequence: 4 (4 bases) 
<score: 1155 (match = 3, mismatch = 0, gap penalty =8+1 per base) 
<endgaps not penalized 

GenBank (Release 149, aug 2005)[Sep 20 13:32:19 2005]: 1 sequence found 



H58326 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 

source 



BASE COUNT 
ORIGIN 



yr25c07.rl soares fetal liver spleen 1NFLS Homo sapiens cdna clone 
IMAGE: 206316 5', mRNA sequence. 418 bp, 
mRNA , linear, EST 05-OCT-1995 
H58326 

H58326.1 Gl:1011158 

EST; WASHU_est; 5_prime; merck_est. 

Homo sapiens (human) 

Homo sapiens 

1 (bases 1 to 418) 

Hillier,L., wilson,R., etal 

The washu-Merck EST Project 

Unpublished (1995) 

High quality stops: 251; stops: 251; insert: 912. 
Locati on/Qua! i f i ers 
1..418 

/orqanism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="GDB: 3775447" 
/db_xref="taxon:9606" 
/clone=" IMAGE: 206316" 
/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host="DHl0B (ampicillin resistant)" 
/clone_lib="Soares fetal liver spleen 1NFLS" 
/note="Organ: Liver and Spleen; Vector: pT7T3D (Pharmacia) 
with a modified polyl inker; site_l: Pac I; Site_2: Eco Rl; 
1st strand cDNA was primed with a Pac I - oligo (dT) primer 

[5' AACTGGAAGAATTAATTAAAGATC I I I I I I I I I I I I I I I I I I I 3'], 

double-stranded cDNA was li gated to Eco RI adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco Rl sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento soares and M.Fatima Bona! do." 



ss.DNA57836 
SS.DNA57836 
SS.DNA57836 
SS.DNA57836 

SS.H58326 
SS.DNA57836 



GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAG 
10 20 30 40 50 60 

GCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGG 
70 80 90 100 110 120 



CACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCC 
130 140 150 160 170 180 

CCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCT 
190 200 210 220 230 240 

10 20 
GAAGGTGGGTGAAGACCCACGAG 

****** **************** 

ATTCCCTCTGTGG AACCAAG AAC ATC AAGGTGGCC AAG AAGGT - GGTG AAG ACCCACG AG 
250 260 270 280 290 



30 40 50 60 70 80 

SS.H58326 CCGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAG ACCTGCTCACCTACTTC 

************************************************ 

SS . DNA57836 CCGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAG ACCTGCTCACCTACTTC 
300 310 320 330 340 350 

90 100 110 120 130 140 

SS.H58326 TGCCGGGCGTCCTCCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGG 

************************************************************ 

S S . DNA5 78 3 6 TGCCGGGCGTCCTCCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGG 
360 370 380 390 400 410 

150 160 170 180 190 200 

SS.H58326 GAGCTGTGGTCCAAGCCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGG 

************************************************************ 

SS . DNA57836 GAGCTGTGGTCCAAGCCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGG 
420 430 440 450 460 470 

210 220 230 240 250 260 

SS.H58326 GCAGG-CCCAGGGTGGAGATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAAC 

***** ****************************************************** 

SS . DNA57836 GCAGGCCCCAGGGTGGAGATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAAC 
480 490 500 510 520 530 

270 280 290 300 310 320 

SS.H58326 AGCCTGATCGGGAAGGATGGGCAGGTCCACCTXCAGCAG AGACCATGCCACAGGAGGCCT 

******************************** ********************* **** 

SS . DNA57836 AGCCTGATCGGGAAGGATGGGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCT 
540 550 560 570 580 590 

330 340 350 360 370 380 

SS.H58326 GCCAACTTGTCCTTCCTTGCCGAGCCAGACATCGGACTGG I I I I I GGTGCCAGGGTGCAA 

******** ******* *********************** ** ********** ***** 

SS . DNA57836 GCCAACTTCTCCTTCC-TGCCGAGCCAGACATCGGACTGG-TTCTGGTGCCAGGCTGCAA 
600 610 620 630 640 650 

390 400 410 

SS.H58326 ACAACGTCAATGTCCAGXACAGGGXCCTTTAGAXTT 

****** ********** **** * ** * * * * 

SS . DNA57836 ACAACGCCAATGTCCAGCACAGCG - CCCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAG 
660 670 680 690 700 710 

SS . DNA57836 ATGGAGGACTGGCAGGGTCCCCTGGAGAGCCCCATCCTTGCCTTGCCGCTCTACAGGAGC 
720 730 740 750 760 770 

SS . DNA57836 ACCCGCCGTCTGAGTGAAGAGGAGTTTGGGGGGTTCAGGATAGGGAATGGGGAGGTCAGA 
780 790 800 810 820 830 

SS . DNA57836 GGACGCAAAGCAGCAGCCATGTAGAATGAACCGTCCAGAGAGCCAAGCACGGCAGAGGAC 
840 850 860 870 880 890 

SS . DNA57836 TGCAGGCCATCAGCGTGCACTGTTCGTATTTGGAGTTCATGCAAAATGAGTGTGTTTTAG 
900 910 920 930 940 950 



SS.DNA 57836 CTG CTCTTG C C AC AAAAAA AAAA AAA AAAAA AAAA A 
960 970 980 990 



<first sequence: SS.R02548 (length = 379) 
<second sequence: ss.DNA57836 (length = 992) 

<205 matches in an overlap of 379: 54.09 percent similarity 
<gaps in first sequence: 10 (107 bases), gaps in second sequence: 5 (14 bases) 
<score: 443 (match = 3, mismatch = 0, gap penalty =8+1 per base) 
<endgaps not penalized 



GenBank (Release 149, aug 2005)[Sep 20 13:31:10 2005]: 1 sequence found 



R02548 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 

source 



BASE COUNT 

ORIGIN 

SS.DNA57836 



ye80a07.sl Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
IMAGE: 124020 3 1 , mRNA sequence. 379 bp, 
mRNA, linear, EST 31-MAR-1995 
R02548 

R02548.1 Gl:752284 

EST; WASHU_est; 3_prime; merck_est. 

Homo sapiens (human) 

Homo sapiens 

1 (bases 1 to 379) 

Hillier,L., Wilson,R., etal 

The washu-Merck EST Project 

unpublished (1995) 

High quality stops: 309; stops: 309; insert: 846. 
Locati on/Qual i f i ers 
1. .379 

/orqanism= ,, Homo sapiens 1 ' 

/mol_type="mRNA" 

/db_xref="GDB: 476565" 

/db_xref= n taxon:9606 n 

/clone="IMAGE:124020" 

/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host="DHlOB (ampicillin resistant)" 
/clone_lib="Soares fetal liver spleen 1NFLS" 
/note="Organ: Liver and Spleen; vector: pT7T3D (Pharmacia) 
with a modified polyl inker; Site_l: Pac I; Site_2: Eco RI; 
1st strand cDNA was primed w ith a Pac I - oligo (dT) primer 
[5 ' AACTGGAAGAATTAATTAAAGATC I I I I I I I I 1 I I I I I I I I I I 3'] , 

double-stranded cDNA was li gated to Eco RI adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bona! do." 

GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAG 
. 10 20 30 40 50 60 



SS.DNA57836 
SS.DNA57836 
SS.DNA57836 

SS.R02548 
SS.DNA57836 



GCATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGG 
70 80 90 100 110 120 



CACGGGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCC 
130 140 150 160 170 180 

CCAAAGGCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCT 
190 200 210 220 230 240 

10 

I I I I I GTGGCAAGAGC 

* * ** 

ATTCCCTCTGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGC 
250 260 270 280 290 300 



SS.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

ss.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

SS.R02548 
SS.DNA57836 

SS.DNA57836 

SS.DNA57836 

SS.DNA57836 

SS.DNA57836 



. 20 30 40 50 60 70 

AG - - CTAAAACACACTCA - -TTTTGCATGAACTCCAAATACGAACAGTGCACGCTGAT-G 

* ** ** **** * * ** **** -kit it * *** * 

CGGCCTCCTTCAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCT 
310 320 330 340 350 360 

80 90 100 110 120 130 

GCCTGCAGTCCTCTGCCGTGCTTGGCTCTCTGGACAGT-TCATTCTACATGGCTGCTGCT 

*** * ****** ** *** ********* ** ***** **** 

GCCGGGCGTCCTCCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACA GATGCA 

370 380 390 400 410 

140 150 160 170 180 

TTGCGTCCTCT-GACCTCCCCATTCCCTATCCT--GAACCCCCCAAACTCCTCTTCACTC 

** * ****** *** * ** ****** **** * ** 

CTGGGAGCTGTGGTCCAAGCCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAG 
420 430 440 450 460 470 

190 200 210 220 

AGACGGGGGGTGCTCCTGTAGAGCGGCAAGGCAAGG 

** *** ** * ** *** * ** *** 

AG-GGGCAGGCCCCAGGGTGGAGATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCA 
480 490 500 510 520 530 



CCAACAGCCTGATCGGGAAGGATGGGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGC 
540 550 560 570 580 590 

230 240 250 260 270 

____ ATGGGGCTCTCCAGGGGACCCTGCCAGTCCTCCATCTTCTGGTCACCACCT 

* * *** * ** ** * ** ** * ******* * ** 

AGCCTGCCAACTTCTCCTTCCTGCCGAGCCAGACA-TCGGACTGGTTCTGGTGCCAGGCT 
600 610 620 630 640 650 

280 290 300 310 320 330 

GGGCCCA- - -CGTGGGTCCAGCCCAGCATCCTGGGAGGTGATGGCCGCAGTGGGAGGCAA 

* ** * ******* **** *** * *** ** ** *** ** * * * 

GCAAACAACGCCAATGTCCAGCACAGCGCCCTCACA-GTGGTGCCCCCAGGTGGTGACCA 
660 670 680 690 700 710 

340 350 360 370 

GGXTGCCAACCAGCACGA-TGGTGGGGTGCCCTGGGGAAACCC-TCCTGG 

* ** * * ** *** **** ****** ** *** **** * 

G AAGATGGAGGACTGGCAGGGTCCCCTGGAGAGCCCCATCCTTGCCTTGCCGCT 

720 730 740 750 760 

CTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGGGGGTTCAGGATAGGGAATGG 
770 780 790 800 810 820 

GGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACCGTCCAGAGAGCCAAGCAC 
830 840 850 860 870 880 

GGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGAGTTCATGCAAAATGAG 
890 900 910 920 930 940 

TGTGTTTTAG CTG CTCTTG CCAC AAAAAAAA AAAA AAAAAAAAAAA 
950 960 970 980 990 
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Genetic Instability in Epithelial 
Tissues at Risk for Cancer 

WALTER N. HITTBLMAN 

Department of Experimental Therapeutics, The University of Texas 
Af. D. Anderson Cancer Center, Houston, Texas 77030, USA 



Abstract Epithelial tumors develop through a multistep process driven by 
genomic instability frequently associated with etiologic agents such as pro- 
longed tobacco smoke exposure or human papilloma virus (HPV) infection. 
The purpose of the studies reported here was to examine the nature of genomic 
instability In epithelial tissues at cancer risk in order to identify tissue genetic 
biomarkers that might be used to assess an individual's cancer risk and 
response to chemopreventive Intervention. As part of several chemopreventlon 
trials, biopsies were obtained from risk tissues (L&, bronchial biopsies from 
chronic smokers, oral or laryngeal biopsies from individuals with premattg* 
nancy) and examined for chromosome instability using in situ hybridization. 
Nearly all biopsy specimens show evidence for chromo some instability 
throughout tne exposed tissue, increased chromosome instability was ouserved 
With histologic progression in the normal to tumor transition of head and neck 
squamous cell carcinomas. Chromosome instability was also seen in premallg- 
nant head and neck lesions, and high levels were associated with subsequent 
tumor development In bronchial biopsies of current smokers, the level of 
ongoing chromosome Instability correlated with smoking intensity (eg., 
packs/day), whereas the chromosome index (average number of chromosome 
copies per cell) correlated with cumulative tobacco exposure (he., pack-years), 
Spatial chromosome analyses of the epithelium demonstrated multifocal clonal 
outgrowths. In former smokers, random chromosome instability was reduced; 
however, clonal populations appeared to persist for many years, perhaps 
accounting for continued lung cancer risk following smoking cessation. 

Keywords; chromosome instability; epithelial cells; aerodigestive tract; 
chemopreyention; cancer risk 



THE NEED FOR BIOMARKERS OF CANCER RISK AND 
RESPONSE TO INTERVENTION 

Epithelial cancers remain a major health challenge in the world. Despite improve- 
ments in staging and the application and integration of surgery, radiotherapy, and 
chemotherapy, the 5-year survival rate for individuals with lung cancer is only about 
15&. 1 Even if strategies for early detection are successful and lung cancers 
are detected at a stage where local tumor resection and treatment is curative, 
these patients will still be at significant risk for developing second primary tumors 

Address for correspondence: Dr. Walter N. Hittelraan, Department of Experimental Thera- 
peutics, The University of Texas M D. Anderson Cancer Center, 1515 Holcombe Blvd. (Box 
19), Houston, Texas 77030. Voice: 713-792-2961; fax: 713-792-3754. 

whittelm@mdanderson.org 
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associated with the problem of field cancerization. 2 Similarly, for individuals with a 
first head and neck primary tumor, even if the first malignancy is successfully treat- 
ed, the risk of developing a second primary in the tobacco smoke-exposed field is 
approximately 40%. 3 Similar cancer risk estimates exist for individuals who exhibit 
severe dysplasia in premalignant epithelial lesions. 4 For these reasons, it is important 
tb focus on chemopreventive strategies to_prevent thedevelopnient of epithelial 
malignancies. 

. Several problems confront chemoprevention trials designed to identify effica- 
cious agents. 5 First, chemoprevention trials with cancer incidence as a primary end- 
point require tens of thousands of subjects and tens of years of intervention and 
follOw-up for statistical evaluation. For example, a recently reported trial involved 
30,000 subjects and required 10 years in order to examine the impact of prevention 
strategies on lung cancer development, only to find a possible increased lung cancer 
incidence in current smokers who received ^-carotene. 6 

The problem of large, long-term trials results from the difficulty in identifying 
individuals at highest cancer risk who might best benefit from chemopreventive 
intervention. For example, 20 pack-year smokers, while known to be at relatively 
increased risk for developing lung cancer, have approximately a 10% lifetime risk 
for developing lung cancer/ This seriously limits the number of potentially useful 
Strateg i e s tha t ra n hp . r . linirnlly PTplnrpd A sftfinnd prnhlpm fari ng r hftirinpmvftntinp 
trials is that little is known about what agents are likely to have efficacy, and even 
less is known regarding proper doses, schedules, and durations of treatment Part of 
the reason for this problem is that too little is known about the physiologic processes 
that drive epithelial cancer development 

In order to reduce the number of subjects and the time required to carry out 
chemoprevention trials and thus allow the exploration of multiple prevention strate- 
gies, two types of advances are necessary. First, it is important to identify individuals 
at significantly increased cancer risk who might best benefit from different types of 
intervention. Second, in order to allow the rapid identification of agents, doses, and 
schedules of potentially efficacious agents, it is necessary to identify and validate 
surrogate endpoints of response that indicate whether the agents are having a posi- 
tive impact on the target tissue during the chemopreventive intervention. 

One approach to identifying individuals at increased aerodigestive tract cancer 
risk is to explore epidemiologic features of potential subjects. Molecular epidemio- 
logic studies are beginning to identify intrinsic host factors that place some individ- 
uals at increased cancer risk, especially those with a chronic smoking history. 8 Most 
intrinsic factors identified thus far reflect levels of carcinogen metabolism, repair 
capabilities of the host following DNA damage, and other measures of intrinsic 
cellular sensitivity to mutagens. Whilelhese factors can provide statistically signif- 
. icant risk ratios in case-control studies that are controlled for tobacco exposure, the 
detected risk ratios usually fall in the range of 1.5 to 10. Unfortunately, this is not 
sufficient for the individualization of treatment and is not sufficiently high tb signif- 
icantly reduce the numbers of subjects required for chemoprevention trials with 
cancer incidence as the primary endpoint. 

Another approach to identifying individuals at increased cancer risk is to directly 
examine the target tissue of individuals with known carcinogen exposure (e.g., 
chronic tobacco smoke exposure), who have evidence of target organ dysfunction 
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(e.g., chronic obstructive pulmonary disease, changes in voice quality), or who 
have clinical evidence of premalignancy (e.g., bronchial metaplasia/dysplasia, oral 
leukoplalria/erythroplakia, cervical intraepithelial neoplasia). The conventional 
standard for assessing cancer risk in these situations is the degree of histological 
change. However, while individuals who show moderate to severe dysplasia are 
known to be at increased cancer risk when compared to individuals with lesser his- 
tologic changes, it is often difficult to distinguish reactive changes to carcinogenic 
insult from initiated and progressing lesions. Similarly, upon cessation of carcino- 
genic insult, histologic changes may reverse yet cancer risk may continue for many 
years. For example, while smoking cessation is associated with decreased bronchial 
metaplasia; 9 increased lung cancer risk continues for many years beyond smoking 
cessation. 10 In fact, nearly half the newly diagnosed lung cancer cases in the USA 
occur in former smokers. 

The development of assays to identify individuals at high epithelial cancer risk 
and to directly assess response to intervention in the target tissue is therefore an 
important research goal. Such assays should be objective and easily quantifiable and, 
if possible, minimally invasive. Moreover, they should reflect both the disease pro- 
cess and the targeted pathway and thereby be useful in assessing risk and monitoring 
response to intervention as well as directly testing the hypothesized mechanism of 
action- of the ch emopr eventiv e strategy. 

1° the chemoprevention setting it is important to recognize that one does not 
know the location of the future cancer. Thus, assays must necessarily be carried out 
on random biopsies of the field at risk. Even if there are clinically evident premalig- 
nant lesions, this does not mean that this is the likely site for a future malignancy. 
For example, nearly half of the cancers that develop in individuals with oral leuko- 
plakia arise away from the original index lesion. Similarly, since many newly diag- 
nosed lung cancers arise in the peripheral parts of the lung (e.g., adenocarcinomas), 
especially in former smokers, and since endobronchoscopy predominantly accesses 
central components of the lung, it is important to identity biomarkers that can reflect 
global processes ongoing in the target epithelial field associated with increased can- 
cer risk. Their discovery requires a better understanding of the tumorigenesis pro- 
cess in epithelial fields at cancer risk. 



THE RATIONALE FOR STUDYING 
GENOMIC INSTABILITY AS A MARKER OF RISK 

-^TYimors'of roeaerodigestive tract have been proposed tbl^flecfa "field canaliza- 
tion" process whereby the whole tissue is exposed to carcinogenic insult (e.g., tob- 
acco smoke) arid is at increased risk for multistep tumor development. 12 ' 13 Several 
types of clinical and laboratory data support this notion, including the frequent 
occurrence of synchronous primary and subsequent second primary tumors in the 
aerodigestive tract (frequently exhibiting dissimilar histologies as well as distinct 
genetic signatures 14 - 16 ) and the presence of premaiignant lesions that precede and/or 
accompany the tumor in the exposed tissue field. 17 The notion of a multistep tumor- 
igenesis process is further supported by serial clinical and histologic evaluations of 
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target tissue or exfoliated cells where increasing degrees of histological abnormali- 
ties are observed over time. 18 

A working mode] for aerodigestive tract tumorigenesis is illustrated in Figure 1. 
lumorifeenesis in the face of carcinogenic exposure likely involves a chronic process 
of tissue injury and wound healing. DNA damage induced by the carcinogen is likely 
fixed into permanent genetic changes (e.g., chromosome damage, chromosome non- 
disjunction, gene mutation, gene deletion, etc.) during the process of proliferation. 
This damage would be expected to be distributed throughout the exposed tissue field 
leading to a background of generalized genomic damage (depicted in Figure 1 as a 
background mat of increasing density). Chronic injury and repair likely leads to the 
accumulation of cells with increasing amounts of genetic changes as welt as the out- 
growth of abnormal clones (triangles in Figure 1) carrying an accumulation of 
genetic changes important for selective survival, dysregulated growth, and preferen- 
tial epithelial take-over by initiated clones (see Figure 2). 

Cellular and molecular evidence for the field carcinogenesis and multistep tum- 
origenesis model comes from many laboratories. 19 * 20 With the advent of a wide array 
of molecular technologies, a large number of specific molecular genetic and epige- 
netic changes involving specific oncogenes, tumor suppressor genes, cell regulatory 
genes, and repair genes have now been described for aerodigestive tract cancers. The 
i de n t i fica ti o n o f these spe ci fic mol ecular changes have now provided probes to 
explore specific events occurring in premalignant lesions adjacent to aerodigestive 
tract tumors. 21-24 Frequently, these premalignant lesions showed a subset of the 
same molecular changes found in the associated tumor, suggesting that these lesions 
might represent precursor lesions for the associated tumors (i.e., a manifestation of 
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FIGURE 1. Field cancerization and multistep tumorigenesis. 
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FIGURE 2. Multiple focal clonal evolution during multistep tumorigenesis. 



a multistep tumorigenesis process). For example, studies of the premalignant lesions 
adjacent to head and neck tumors have provided evidence for a gradual accumulation 
of genetic alterations accompanied by evidence for dysregulation of cellular control 
mechanisms (e.g., alterations in expression of PCNA, EGFR, TGF-fl, p53, and 
cyclinDl). 25 - 28 

These types of studies have now also been applied to the target epithelium of indi- 
viduals at increased risk for aerodigestive tract cancer (i.e., individuals with a chron- 
ic smoking/alcohol history and/or prior aerodigestive tract cancer). Several groups 
(using polymerase chain reaction, PCR, analysis of microdissected epithelium) have 
now demonstrated the presence of clonal outgrowths in the target premalignant epi- 
thelium of individuals at increased risk for cancer. 29 " 31 For example, exarnination of 
bronchia] biopsies derived from individuals with a 20 pack-year smoking history 
demonstrated that 76% of the cases showed evidence for LOH (3pl4, 9p21, or 
17pl3) in at least one of six lung biopsy sites. On a per site basis, some form of LOH 
was observed in 25% of the sites examined. 29 

If aerodigestive tract cancer development reflects a field cancerization process 
"involving multistep events7then rislfand response informafioTsfiould beablelo be" 
derived from random biopsies or exfoliated cells from the field at risk or from assess- 
ments of tissue undergoing similar processes. Hypothetically, lesions exhibiting the 
greatest degree of genomic instability, clonal outgrowth, and abnormal epithelial 
regulation would be at the highest relative aerodigestive tract cancer risk. Similarly, 
an active chemopreventive intervention might be expected to decrease these mani- 
festations of risk. Reduced risk manifestations include decreased levels of ongoing 
genetic instability, decreased frequency of clonal outgrowths, and increased epithe- 
lial growth regulation. 
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THE MEASUREMENT OF CHROMOSOME INSTABILITY USING 
CHROMOSOME IN SITU HYBRIDIZATION 

Molecular genetic techniques, while extremely useful for detecting clonal chang- 
es in targets tissues, are somewhat limited in their ability to detect random genetic 
instability. Conventional cytogenetic assays are useful for detecting chromosome 
instability and clonal chromosome changes. However, they require numbers of 
dividing cells for karyotypic analysis that are difficult to attain in the setting of biop- 
sies acquired during the course of a chemoprevention trial. A technique was there- 
fore needed that would allow chromosome instability measurements in situations 
where few cells are available (e.g. small biopsies, brushings, or sputum samples) and 
where the target material might be fixed. It was also desirable to have a technique 
that would be adaptable to tissue sections, whereby spatial information could be 
retained and genotype/phenotype associations could be determined on the same or 
adjacent tissue sections. The technique of in situ hybridization (ISH) involves the 
use of DMA probes that recognize either chromosome-specific repetitive target 
sequences, chromosome single gene copy sequences, or sequences along the whole 
chromosome length or chromosome segments. 32 We have adapted the ISH technique 
for formalin-fixed, paraffin-embedded tissue sections and have applied it to a variety 
of tissues, including the aerodigestive tract 33 * 34 • 

Using probes that label the centromere regions of specific chromosomes, this 
assay permits determination of the average chromosome number per cell for each 
specimen. This assay is also useful for detecting generalized chromosome instability 
during the tumorigenesis process. Normal diploid populations should have two cop- 
ies of each autosomal chromosome and should rarely show three or more chromo- 
some copies per cell (chromosome polysomy), especially in tissue sections where 
nuclear truncation results in an under-representation of chromosome copy number. 
Thus, die detection of cells with three or more chromosome copies would indicate 
the presence of chromosome instability. 

To examine this technique's potential for characterizing the multistep tumorigen- 
esis process in the aerodigestive tract, we measured the fraction of cells exhibiting 
three or more chromosome copies in apparently contiguous epithelial transitions 
from normal to hyperplastic to dysplastic to carcinomas, all on a single tissue slice 
of head and neck squamous cell carcinomas. 34 In these specimens, greater than 35% 
of the cases of adjacent "normal" epithelium, greater than 65% of the cases of hyper- 
plastic epithelium, and greater than 95% of the dysplastic and tumor regions showed 
evidence of chromosome polysomy. Of interest, similar transitions of chromosome 
instability were observed with at least four different chromosome probes. Similar 
trends hav e also bee n observed in amenabl e tissue from other epithelial mali g nan- 
cies, including cervix, bladder, and breast 3 * These results thus suggested that the 
notions of field cancerization and multistep tumorigenesis might apply to several 
epithelial tissues and that measures of chromosome instability might be useful for 
monitoring this process. 

In the situations described above, the premalignant lesions examined might be 
considered to represent epitheUum at 100% risk of being in a cancer field, since they 
were located in the adjacent epithelium to the cancer. This then raises the question 
of the nature of genetic instability in the epithelium of individuals at increased risk 
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for developing cancer. To explore this issue, we obtained biopsies daring the course 
of leukoplakia cheraoprevention trials exploring the use of 13-ci>retinoic add in 
reversing leukoplakia and probed them for genetic instability using in situ hybridiza- 
tion. In one retrospective study and in one prospective study of subjects with oral 
leukoplakia, the results indicate that those subjects whose pretreatment biopsies har- 
bor relatively high levels of genomic instability (i.e., more than 3% of the cells 
examined showing at least 3 chromosome 9 copies per cell) have a significantly 
higher likelihood 6f suffering early onset of head and neck cancer. 36,37 Interestingly, 
half of the tumors that did develop occurred away from the biopsy site used to mea- 
sure genetic instability. This result suggests that genomic instability measurements 
in carcinogen-exposed tissue can provide useful cancer risk estimates. 



THE RELATIONSHIP BETWEEN TOBACCO EXPOSURE AND 
CHROMOSOME INSTABILITY 

In recent years, the aerodigestive tract chemoprevention group at M.D. Anderson 
Cancer Center has initiated three sequential biomarker-associated chemoprevention 
trials involving chronic smokers with a greater than 20 pack-year smoking history. 
^n-eaeh-ofrhese^ta&&s r endobronchial4>^ 

within the lung, including the carina and at bifurcation points at the upper, middle, 
and lower right lung and at the upper and lower left lung. Biopsies were obtained pri- 
or to and following chemopreventive intervention and were subjected to in situ 
hybridization analysis in addition to analyses for other biomarkers. The first impor- 
tant finding was that some degree of chromosome polysoray was evident in all lung 
sites examined, and this was observed independently of the particular chromosome 
probe utilized. 38 This finding supports the notion that random chromosome changes 
may be occurring throughout the exposed lung field. 

In a second study, bronchial biopsies were obtained from individuals with a 20 
pack-year smoking history. In this study, most of the subjects involved were current 
smokers. 39 Interestingly, all cases who showed metaplasia at one of six biopsy sites 
also showed chromosome polysomy in at least one biopsy site; overall, 88% of the 
sites showed some evidence of chromosome 9 polysomy. 40 Evidence for genetic 
instability was also detected in patients who did not show evidence of bronchial 
metaplasia in any of six biopsy sites despite a strong smoking history. In fact, more 
than 90% of the cases and more than 60% of the sites showed significant chromo- 
some polysomy (i.e., at least three copies in at least 2 % of the cells examined). 
These results suggest that the lungs of long-term smokers show significant evidence 
of g enetic instabi lity, and this i nstability can be detected throughout the accessible 
bronchial tree, even when bronchial metaplasia is not evident. . 

These studies in current smokers has allowed us to examine me relationship 
between the levels of genetic instability detected and subject characteristics such as 
smoking status (current or former), smoking history, and lung tissue pathologic 
changes. Evaluable biopsy material has now been obtained from more than 108 cur- 
rent smokers, including more than 480 evaluable biopsy sites. The mean metaplasia 
index in these current smokers was 30.4%. For the total population studied, the 
median chromosome index for the bronchial biopsies was 1.41 (range, 1.04-1.61) 
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and the median chromosome polysomy index was 2.0% (range 0-8.7%). This can be 
compared to a mean chromosome index between 1.2-1.4 for lymphocytes and very 
fare chroraosome polysomy. Interestingly, the intrasubject variability in chromo- 
some instability was relatively low: in most subjects and was less than the intersub- 
ject variability. Tiiese results suggested that chronic smokers harbor detectable 
chromosome instability throughout the accessible bronchial tree (supporting the 
field carcinogenesis notion) and that information from one biopsy site might yield 
representative information for me rest of me lung field. 

Since most of the current smokers exhibited bronchial metaplasia in at least one 
of the biopsied sites, this allowed us to examine the relationship between chromo- 
some instability and histologic changes, both on a site-by-site basis and on a per case 
basis. On a site-by-site basis, the chromosome indices of lesions showing squamous 
metaplasia were similar to those not showing metaplasia (i.e., median 1 .43 vs. 1 .43), 
and the degree of chromosome polysomy in metaplastic lesions were only slightly 
higher than in non-metaplastic sites (medians: 2.2% vs. 1.8%, respectively). Thus, 
the presence or absence of squamous metaplasia at a biopsy site does not necessarily 
correlate with the degree of underlying genomic instability. On the other hand, those 
subjects with metaplasia indices of at least 15% also showed higher levels of chro- 
mosome polysomy than did subjects with metaplasia index below 15% (medians: 
2A% VS, 1.8%, P = 0.005). Thus, these chromosome instability assessments in cur- 
rent smokers appeared to reflect a more global process in the lung field. 

Tobacco exposure has been shown to significantly increase the risk of developing 
lung cancer, and the degree of risk is related to the extent of tobacco exposure. We 
were interested in determining the relationship between ^dividuals* smoking histo- 
ry parameters and the levels of chromosome change found in their lungs following 
years of tobacco exposure. While there was significant intersubject variation for sim- 
ilar tobacco exposure histories, overall there was a significant correlation between 
the degree of chromosome polysomy and the intensity of ongoing tobacco exposure 
(packs/day, p = 0.02 on a per site basis) and with the extent of tobacco exposure 
(pack-years, p = 0.003). Thus the amount of chroraosome polysomy reflects the 
intensity and extent of tobacco exposure. At the same time, individuals with similar 
smoking histories showed widely divergent amounts of chromosome polysomy, pos- 
sibly reflecting differences in intrinsic sensitivity between subjects. There was. also 
strong correlation between the chromosome index and the duration of the smoking 
history (smoking years) and total accumulated exposure (pack-years, p = 0.0001). 
These results suggest that tobacco exposure is associated with the initiation and 
accumulation of chromosome instability in the exposed lung; however individuals 
are differentially sensitive to carcinogenic insult The working hypothesis is that 
those individuals who accumulate the hi g hest d egree of chromosome changes will 
be at the highest lung cancer risk. 

Many of the bronchial biopsies from chronic smokers examined by in situ hybrid- 
ization showed a rise in the chromosome index above that expected for a diploid cell 
population, especially in subjects with an extensive smoking history. The rise in 
chromosome index was also accompanied by an increase in the fraction of cells 
exhibiting at least 3 chromosome copies per cell. To deterrnine if a rise in the tissue 
chromosome index was due to clonal expansion of populations with chromosome tri- 
somy, the chromosome copy number and relative coordinates of each cell scored in 
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the bronchial epithelium was recorded and a spatial genetic map was created. 41 We 
then developed algorithms for calculating localized chromosome indices within the 
tissue. Since trisomic clones would have, on average, three chromosomes instead of 
two, those cells involved in neighborhoods with chromosome indices three-halves 
that of diploid populations could be marked as being part of a trisomic clone. Simi- 
. larly. groups of cells with chromosome indices half that of diploid populations could 
be marked as being part of a monosomic clone. This allowed the generation of a sec- 
ond-order, two-dimensional genetic map representation of the bronchial epithelium 
showing the relative locations of cells involved in monosomic and trisomic clonal 
outgrowths. When adjacent tissue sections from the same bronchial biopsy were 
probed separately for different chromosomes, the detected clones appeared to occu- 
py separate subregions of the epithelium. This result suggests that not only are the 
lungs of chronic smokers undergoing a process of genetic instability, they are expe- 
riencing the outgrowth of multiple clones throughout the exposed lung field, as pos- 
tulated by the models shown in Figures 1 and 2. One advantage of this clonal 
approach is that the contribution of both monosomic and multisomic clones can be 
detected. 

Since smoking cessation has been suggested to reduce the lung cancer risk, it was 
of interest to determine whether the levels of chromosome instability would decrease 
following smoking cessation. This question was possible to examine because our 
third sequential chemoprevention trial involved subjects who had discontinued 
smoking. So far, more than 220 subjects (more than 650 biopsies) who have quit 
smoking (mean 9.9 quit-years) have been evaluated for chromosome instability in 
their lungs. Despite the fact that the mean metaplasia index in this group is 5.8% 
(considerably less than that in current smokers), chromosome instability is still 
observed in the majority of subjects. 42 While the mean chromosome polysomy level 
is reduced to 1.0%, some individuals continue to show polysomy levels above 5%. 
Interestingly, while the overall chromosome polysomy levels were reduced in these 
individuals who stopped smoking, the mean chromosome index remained at about 
1 .4 with some individuals exhibiting chromosome indices as high as 1.8. Initial chro- 
mosome mapping studies suggest that while random chromosome instability seems 
to decrease following smoking cessation, the clonal outgrowths may remain for 
many years in the lung. The working hypothesis is that those individuals who show 
the greatest degree of remaining chromosome instability are at the highest lung can- 
cer risk despite smoking cessation. Long-term follow-up on these subjects will be 
necessary to test this hypothesis. 



SUMMARY AND CONCLUSIONS 

Aerodigestive tract tumorigeftesis appears to be a multistep process taking place 
throughout the tissue fieids of exposure. When viewed in the context of chromosome 
changes, carcinogen exposure appears to be associated with the random acquisition 
Of chromosome polysomy throughout the exposed field, the degree of which is relat- 
ed to the degree and extent of carcinogen exposure as well as to the instrinsic suscep- 
tibility of the exposed individual. Continued exposure leads to continued acquisition 
of new changes and, in association with chronic wound-healing processes, to the 
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accumulation of clonal outgrowths throughout the target tissue. Although the ulti- 
mate malignancy may occur in only one or few tissue sites, manifestations of the 
instability process that drives tumorigenesis is globally present in the tissue. Thus 
random; biopsies may provide useful risk information for the exposed Meld as a 
whole. Even when carcinogen exposure is reduced or chemopreventive strategies are 
initiated and histologic manifestations of the tumorigenesis process subside, the 
genetic scars of prior exposure remain in the form of clonal outgrowths and may 
explain continued lung cancer risk in ex-smokers. Future chemoprevention strategies 
need to focus on reducing the degree of chromosome instability and on trying to 
eliminate residual abnormal clonal outgrowths in the aerodigestive tract In this set- 
ting, the measurement of chromosome instability in the target tissue will be useful in 
assessing cancer risk as well as response to intervention. 



The studies reviewed here represent one component of the collaborative efforts 
of the Aerodigestive Tract Chemoprevention team at The University of Texas M.D. 
Anderson Cancer Center, Houston, Texas. The studies were supported in part by 
^tienatfnstitotes<>fHealth^tien^ 

CA 79437, CA 16672, CA 68089, CN 25433, CA 86390, CA 70907, NTH DE 13157, 
and the State of Texas Tobacco Research Fund. 
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42. HnTBLMAN, W.N., J J. Lee, J.S. Leb, et al. 1998. Persistent genetic instability despite 

decreased proliferation in human lung -tissue following smoking cessation. Proc. 
AACR 39: 336. 
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DECLARATION OF AUDREY D. GODDARD. Ph.D UNDER 37 C.F.R. § 1.132 



Assistant Commissioner of Patents 
Washington, D.C. 2023 1 

Sir: 

1, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 

forms part of this Declaration (Exhibit A). 
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Serial No.: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et al., PCR 
Methods Appl., 4:357-362 (1995) (Exhibit C) and Heid et al., Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et al., Proc, 
Natl Acad. Sci. USA 95(25): 147 17- 14722 (1998) (Exhibit E); Pitti et al, Nature 
396(67 12):699-703 (1998) (Exhibit F) and Bieche et g/.. Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al. have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al. studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et al. used the assay to study gene amplification in breast cancer. 
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Serial No.:* 
Filed:* 

7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown* 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 




Date 



Audrey D. Goddard, Ph.D. 
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AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 
1 DNA Way 

South San Francisco, CA, 94080 

650.225.6429 

goddarda@gene.com 



110 Congo St. 

San Francisco. CA, 94131 

415.841.9154 

415.819.2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

« * u 1993-present 
Genentech, Inc. 

South San Francisco, CA 
2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

. Acquisition of clinical samples from Genentech* clinical tnals for translational research 
. Translational research using clinical specimen and data for drug development and 

. timber of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: . . 

. Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 

1998-2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 

? Managemenf'of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

. DNA sequencing core facility supporting a 350+ person research facility. 

. DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

. Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

. Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1998 Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 



Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89-12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocyte leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 



McMaster University 

Hamilton, Ontario, Canada with Dr. G. D. Sweeney 
5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 

"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 

Honours B.Sc 

"The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 

McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1989 



1983 
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ACADEMIC AWARDS 

Imperial Cancer Research Fund Postdoctoral Fellowship 1989-1992 

Medical Research Council Studentship 1983-1988 

NSERC Undergraduate Summer Research Award 1983 

Society of Chemical Industry Merit Award (Hons. Biochem.) 1983 

Dr. Harry Lyman Hooker Scholarship 1981-1983 

J.LW. Gill Scholarship 1981-1982 

Business and Professional Women's Club Scholarship 1980-1981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 
2000 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 
The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocytic leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 
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PATENTS 

Goddard A, Godowski PJ, Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P Gurney A. Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372.491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone ' insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074.873. Date of Patent: June 1 3. 2000 

Attie K Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J. Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S., Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448." 

Xie M-H Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K,' Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J Heldens S, Schow P, Goddard AD, Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ. Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 
Pennica D Swanson TA. Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J. Taneyhill LA, 
Deuel B Lew M Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P. Goddard AD, 
Hillan KJ Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysacchande-mduced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispecific IgG. Nature Biotechnology 16(7): 677-681. 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 

Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 

mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 

Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2L/TRAIL 

contains a truncated death domain. Current Biology. 7(1 2): 1 003-1 006. 

Hynes M Stone DM. Dowd M, Pitts-Meek S, Goddard A. Gurney A and Rosenthal A. (1997) 

Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 

19:15-26. 

Sheridan JP Marsters SA, Pitti RM, Gurney A., Skubatch M. Baldwin D. Ramakrishnan L, 
Gray CL Baker K, Wood Wl, Goddard AD, Godowski P. and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 



Audrey D. Goddard, Ph.D page 7 of 9 



Goddard AD Dowd P, Chernausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R. Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD Sherman D, Ho WH, Stone D, Bennett GL, Moffat B, Vandlen R, Simmons L, Gu Q, 
Honqo JA Devaux B, Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A Phillips H 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that .nteracts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21. 
Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL, Scott I MP Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A^ (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Son.c hedgehog. Nature 
384(6605): 129-34. 

Marsters SA. Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A (1 996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
^veNRAF-in?eracting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. Sci. USA 93: 8241-8246. 

Yang M Luoh SM, Goddard A, Reilly D, Henzel W and Bass S. (1996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp.187-215. 
Treanor JJS, Goodman L, de Sauvage F, Stone DM. Poulson KT, Beck CD. Gray C Armanini 
MP Po Hocks RA. Hefti F, Phillips HS. Goddard A, Moore MW, Buj-Bello A. Davis AM Asa. N, 
TTkahashi M. Vandlen R, Henderson CE and Rosenthal A. (1996) Charactenzation of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD Gu Q Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-7113. 

Winslow JW Moran P, Valverde J, Shih A, Yuan JQ, Wong SC. Tsai SP, Goddard A Henzel 
w7h^ Find Caras I. (1995) Cloning of AL-1. a ligand for an Eph-related tyros.ne kmase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD Zeigler-FC, Gu Q. Fendly B. Goddard AD. Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. Sci. USA 92: 1 866-1 870. 

Huanq X Yuang J. Goddard A, Foulis A. James RF, Lernmark A. Pujol-Borrell R, 
Rab"noviich A. Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD Yuan JQ, Fairbairn L, Dexter M. Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammaban Genome 
6: 732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 10720-10728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Floretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocytic leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet. 21: 321-376. 

Borrow J Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL. (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell. Genet. 59: 248-252. 

Foulkes W. Goddard A. and Patel K. (1 991 ) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254:1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet. Cell. Genet. 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1580. 
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Myers JC Jones TA, Pohjolainen E-R, Kadri AS, Goddard AD, Sheer D, Solomon E and 
Pihlaianie'mi T. (1990) Molecular cloning of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
Genet. 46: 1024-1033. 

Gallie BL, Squire JA, Goddard A, Dunn JM, Canton M, Hinton D, Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest. 62: 394-408. 
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SIMULTANEOUS AMPLIFICATION AND DETECTION ( 
SPECIFK DNA S1QUIHCES 

« ««H wi«,,rhi* Gavin Dollincer 1 , P. Sean Walsh and Robert Griffith 

'"Corresponding author. 



We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a VCR. Since the fluorescence of 
EtBr increases in the presence of double- 
stranded (ds) DNA an increase in fluores- 
cence in such a PCR indicates a positive 
amplification, wbicb can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PCR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 



'carryover" false positives in subsequent testing . 

These downstream processing steps would be 
rated if specific amplification and detection of anrptrfied 



if— f 

These ^downstream processing steps wwW be ela- 
ted tf specific amplification and detection of -ampUbcd 
DNA took place simultaneously within an unopened re- 
action vessel Assays m which $uch different processes take 



aCtlOn VCBSCi. rtiNlJ* ma « ww* --- c 

place without the need to separate reaction components 
have been termed 'tomogeheous''. No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progrtss towards this end has been reported, Chenab, et 
aj.™ developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product AlWe-apeeific primers. each with different B«w- 1 
Tesvsent tags, wcrcuscd to indicate the pew 
DNA. However, the unincorporated primers must still Oe 
removed in a down««am process m order « vBuah^ «he 
result Recently, Holland, et al. 1 '. developed 
iwhich the endogenous 5' <xonudease assay of Taq DNA 

ude y p«*e. The prtbe would only cfcave if PCR a mp5ft- 
cation had produced its compUmentary to 
order to detect toe ck^^prod"^. however, a svbsr,- 

q "wc hE^de^elo^da truly homogenTOm a^ay forPCR 
and PCR product detecdoo based upon tbc gready m- 
S^ fluorVsoTru: that ethidiuo btontide and otter 
DNA binding dyes exhibit when tney are bound tcvds- 
DNA l ^' e . As outhncd in Figure h a prototype PCR 



Although the potential benefits of PCR todn> 
kal dLgncsdcs «rc v«tl known 2 - 5 , it is sujl not 
widely used in this setting, even though h is 
four year* iiuco thcrnKwtaW? DNA poV"^* 
mw* msde PCR practical. Some of the reasons for «s slow 
acceptance are high cost, lack of automation of pre- and 
l^t PCR processing steps, and fa^ pos-bve resultt^om 
carrovCT<©ntanur!tioD. The first two points are related 
K Uoori* the largest contributor to cost* die present 
itage of PCR development. Most Current assays rcqu«« 
some form of "downstream- processing once *ermocy. 

is Sne in order xo detinnine whether R the targ« 
DNA seauence was present and has amplified. T*e*e 
fncl^NAhybridi,W*. S-^efec^p^wuhor 
witbout use ofrUi«iondtgesrjon^HPLC*,or capdlary 
rlecnophoresis 10 . These methods arc labor -intense, have 
lowlhrChpuU and arc diBkult to automate. The third 
point is abocloscty related to dovnst^am processing. 
The handling of the PCR product m 
processes increases the chances that amplified DNA wtH 
spread through the typing lab, resulting .u a risk of 
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ttrgft W^tK*** 1 
(apio^imoumiO 



oDi^A primer* 




cobtailtiug 
ratgp» stquencc 



aaONAfCRporftttt 
(yp w ccveid jig) 

HWU 1 Principle of sirmHtancoua amptificalion and detection of 
PCR producL the exponent* of a 1*^0^^ EiBr^t aro 
flitor^cent are listed-f<Br itself, Et»r bound toc^oDNA or 
dsPN A. There a large fbwrcscencc cnh^ccmciii jhcnttBr is 
bound to 0NA and iSndihg w gvcBtly enhanced jrhen DNAjS 
double-stTandcd, After sut&dcaL <n) . cycks ^jf 0 ^ 
increase in d.ipNA results in additional EtBr binding, and ? net 
incrcsmc in total fluamsccncc 
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Gel electrophoresis of PGR amplification products of (he 
human, nuclear gene, HLA DQtt, made in the presence of 
increasing amounts of EtBr (up to 8 M-gtall). The presence of 
tiBr has no obvious effect on she yield or specificity of amplifi- 
cation. 



A. 





HGQU 3 (A) Fluorescence measurement* from PCRs that contain 
0.5 ng/uJ EtBr and thai are specific for Y-chftMifro$0roe repeat 
sequence*. Five replicate PC Ri tvere begun containing each of the 
DNAs specified. At each inclicatcd cyde, one of the five replicate 
PCRs for each DNA was removed from thennocyding and its 
fluorescence measured. Unit* of fluorcaccnce Are artntrarv. (B) 
UV photography of PCR tubes (0,5 ml Eppcndorfctylc, (XHyprO* 
pylcne micro-centrifuge tubes) containing reactions, those start* 
rag from 2 ng male DNA and control reactions without any DMA, 
from (A). 



begins with primers that are single-stranded DNA (as- 
DNA), dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amount* of DNA 1T to 
micit)grams per PCR* 8 , If EtBr is present, the reagent* 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the singk^Gtrandcd 
DNA primers and to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubk>hclw). After the first denatu ration cyde, target 
DNA will be largely single-stranded. After a PGR Li 
completed, the most significant change is the ixtcrcase in 
the amount of dsDNA (the PGR product itself) of up to 
several myograms- Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to s&DN A 
is much less than to dsDNA, the effect of this change on 
the total fluorescence of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
iUuminaiion through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy. 
ding. 

RESULTS 

PGR in the presence of EtBr. In of der to assess the 
affect of EtBr in PGR, ampEfkations of the human Hl^\ 
DQa gfcne >9 were performed with the dye present at 
concentrations from 0.06 to 8.0 figfml (a typical concen- 
tration of EtBr used in staining of nuciek acids fallowing 
get electrophoresis is 0.5 u-g/inf). As shown in Figure 2, gel 
electtttohorcsis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR, 

Detection of human Y-chrt>mowm« specific st* 
aiiences* SequerKfc-spedfic, fluorescence enhancement of 
EtBr ax a result of PCR was demonstrated in a scries of 
amplifications containing 0.5 u-gfal EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc^. These PCRs initially contained cither 
60 ng male, 60 ng female, 2 ng mak human or no DNA. 
Plve replicate PCRs were begun for each DNA. After 0, 
17, 21, 24 and 29 cycles of therniocyding, a PCR for each 
DNA was removed from the therraocyder, and its. fluo- 
rescence measured in a spectrcrffoorotneter and plotted 
vs. amplification cyde number (Fig. 3A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number; 
As shown, the fluorescence increased about, three-fold 
Over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
. cycles were needed to give a detectable increase in fluo- 
rescence. Od electrophoresis w the products of these 
amplifications showed that DNA fragments of the ex- 
pected size were made in the male DNA containing 
reactions and that Utile DNA synthesis took place in the 
control samples. 

in addition, the increase to. fluorescence wa* visualized 
by simply laying the completed, unopened PCRs on a UV 
transilhimiriator and photographing them through a red 
filter. This te shown in figure SB for the reactions that 
began with 2 ng male DNA and those with no DNA. 

Detection of specific allele* of the human p-glohm 
gene. In order to demonstrate that this approach has 
adequate spedfidty to allow genetic screening, a detection 
of the skkfc-ccil anemia mutation was performed. Figure 
4 shows the fluorescence from completed ampMcations 

containing EtBr (0.5 nQ/ml) a* det*»£d by photography 

of the reaction tubes on a UV tjapsilluminator. These 
reactions were performed using- primers specific for c> 
thcr the wiW-tvpe or sickle-cell mutation of the human 
p^lobin gene* 1 . The spedfidty for each allele is imparted 
by placing the sickle-mutation site at the terminal 3 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension— and thus an> 
plificaticri^an take place only if the 5' nucleotide of the 
primer is coranlerncntary to the 0-gtobin allele present'' . 

Each pair 01 amplications shown in Figure 4 consists of 
a reaction with either the wild-type allele spedfk (left 
tube) or sicklc-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wfld-typc p-globin individual (AA); from a heterozygous 
sickle £~gk>bin individual (AS); and from a homozygous 
sickle p-$obin individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pain 
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rtf reactions each). The DNA .type- va reflmcd tn tbc 
have fluorescence intensities in each pair of completed 
^oltficatkms. There was a significant increase in fluorea- 
only where a ^globin aDele DNA matched the 
orimcr set. When measured on a spcctrcflnoronictcr 
Mat* not shown), this 6uorcsccn.ee was about three tomes 
Kt Present in a PGR where both frglobm alkies were 
^matched to the primer sen Gel ckarophofcfM (not 
Ihown) established that this increase in fluorescence was 
Hue to the synthesis of nearly a microgram of a DNA 
Lament of the expected size for p^lobin. There j was 
Ktdc synihesifi of dsDN A in reacuons m. which the allele- 
nnecinc primer was mismatched to both alleles. 
* ContunHHU wkw^to^ of a PGR. 'Using a fiber optic 
devieer H i* possible to direct excitation illumination from 
n spectrofluorometer to a PGR undergoing thcrmocychng 
and to return its fluorescence to the RpcctroftuorometCT. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr-containing amplification of Y-chromo- 
some specific sequences from 25 r»g of human male DNA, 
•7 shown in Figure 5. The readout from a control PCR 
wHh no target DNA is also shown. Thirty cycles of PCR 
v ere monitored for each- s s 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
tly rises and Calls inversely with temperature. The fluo- 
reiccncc intensity is minimum at the denaturation tern- 
ocrature (94°C) and rowdmum at Ac armeaUngtextenMon 
Snpcrature (50X). In the negalive^control fCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbemocycte, indicating that there is 

Bute cbDNA synthesis !? t ^^- appr %5?^gS 
DNA, and there is little if any Weaeh3ngof E*Br during 
the continuous illumination of the sample* 

In the PCR containing male DNA, the HuoTcscencc 
maxima at the annealing/extension temperature begm to 
increase at about 4000 seconds of therroocycl^g, and 
continue to increase with time, mcUcatuig that dsDNA is 
beta produced at a detectable level. Note that the fluo- 
rescence minima at the denatuxation temperature, do not 
fltemficandy increase, presumably becau^ at this temper 
aturc there is no dsDNA for EtBr to bind. Thus the course 
of the amplification is followed by tracking the fluorc* 
cence increase at the annealing temperature. Analysis or 
the products of these two ampbficadonsb^c^pho- 
rerfs showed a DNA fragment of the «pe*c^re f or the 
male DNA containing sample and no detectable UNA 
synthesis for the control sample. 

DISCUSSION ^ 

Downstream processes such as hybndizanon to a ie> 
oucnee^pedfic probe can enhance die specifiaty of DNA 
deietttuu 1» FCR. The chxbu>atk>« of Uieac procewco 
means that' the specificity of this homogeneous assay 
depends solely on that of FCR. In the case of ?* kl £«J* 
disease, we have shown that PGR ak>oc has sufficient DN A 
sequence specificity to permit genetic screening. Using 
approtjriate amplification conditions, there is bttJc T>on- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The ^specificity required to dcccct pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires i detection 
of a viraJ genome that can be at the level of a few copies 
per thoa&nds of host cells* Comoated with geneoc 
screening which is performed on ceils containing at least 
one copy of die target sequence, MV detection requires 
both more specificity and the input of more fctfaJ 
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RmC4 UV photography of PCR tubes containing ampuncattons 
using EtBr that art specific to wWkype (A) or fickle £S> alleles of 
tbc fiinran p-globin gene. The left of each pairof tubes contains 
aBefc-mecffic primers to the wild-type aBeles. the nghi lube 
primers to the sickte atlek. The photograph ^.tafcen after SO 
cycles of PCRj and the input DNAs and tfce aUeJes ihev contom 
Vindicated- Fifty tog of DNA wasused to htm ^R Typmg 
was done in triplicate (3 pairs of PCfci) for each mpu* DNA: 



25 C 



20ng of mole DNA 




94°c 




I ■■ 1 1 

0 2000 4000 6000 8000 
time (sec) 

ItGOnS Continuous, r^^c^nitoxm^ofa P^^apUc 
was «cd to c^rry exdialion light to a PCR m F^^J$> 
ewwtted light back to a fluoromcter (sec Experimental potoco^. 
5^caloa^iog human calcDN Anmcn .naTO 
5tarSff with tO ng of human male DNA (top), or m x control 
PCR i&hout DNA (bottom), were monnorco. Thirtv cydes of 
PCR vert fcJiowed for each. Hie temperature cycled between 
94X (denaturation) and 50*C Cannealiug and extcns>c>nj. Notr m 
uV iiaae DNA PCR, the cyd« (time) depettfent moeasc m 
fluorescence at the anrteafing/extCDsion temperature. 
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DN A— up to microgram amount3~-in order to have suf- 
ficient number* of target sequences. This large amount of 
starting DNA m an amplication simibcantly increases 
the background fluorescence over which any additional 
fluorescence produced by PGR must be detected. An 
additional complication that ocean with targets in tew 
copy-number is the formation of the **prinaer-ctimer" 
artifact. This is the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PGR amplification, and cart compete with 
true PCR targets if those targets are rare, flie primer- 
dimer product is of course dsDNA and thus is a potential 
source of false signal in this homogeneous a*$ay, 
To increase PGR. specificity and reduce the effect of 

Crirner-dimcT amplification, we are invesdgatirjg a rtum- 
er of approaches, including the use of nested-primer 
amplifications that take place in a sortie tube 8 , and the 
"hot-start", in which nonspecific amplification ift reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 23 . Prdhninary results using these ap- 
proaches suggest tbatprhner-diinjCT is effectively reduced 
and it is possible to cfetect the increase in Etfir fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* celts. With larger number* of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequertce-specinc DNA-binding dyes 
that can be made to pxcfcrenbaJly bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' ^add-on" to 
the ougonvdeotide primer 2 ' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PGR is straightforward, 
both once PGR is completed and continuously during 
ihermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is aheadyjpossiblc with existing instru- 
mentation in 96-well format**. In this format, the fluores- 
cence in each PCR can be guantitated before, after, and 
even at selected points during thermocyciing by moving 
the rack of PCRs to a 96-rmerowcM plate fluorescence 

reader 20 . . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberoptics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure $ shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase is detected. Pretimi- 
nary experiments <Higuchi and DoUinger, manuscript in 
preparation) with cotrtinuous rnorirtoring have shown a 
sensitivity to two-fold differences in iniuaJ target DNA 
concentration. 

Conversely, if the number of target molecules is 
fc nown _a$ k can be in genetic screening-rcontinuous 
monitoring may provide a means pf detecting false posi- 
tive and false negative result*. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
resuks due to, for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently ampRfying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
I cles — many more than arc necessary so detect a true 



positive. If a sample fails to have a fluorescence increase 
after this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the cUrdc, an assessment of its fake 
positive/false negative rates wfll need to be obtained using 
a large number of known samples. 

In summary*, tne inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect spcci6c DNA amplification from outside 
::he PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples* 



EXPERIMENTAL PROTOCOt , . ^ 

Human HLA-DQ* geae *mpHBeati<m3 omtauurc Et&. 
PCRs were set up in 100 **l volumes confining 1 0 raM Tns-HCh 
pH 8.3; 50 mM KC1; 4 mM MgC^: « units of Too DNA 
polyittCMSC (Pterkm>£>ittcr Genu, Norwalk, CT); 20 pmote caeti 
of human HtA-DQa ' gene specific oligonucleotide primers 
(;H*6 and CH27 19 and approximately 10* copies of DQfc PCK 
product diluted from a previous Reaction. Ethidium bromide 
(E*Bn SigttvO was used at the concentrations indicated in Figure 
2. Thcrmocyding proceeded for 20 evdes in a mode! 480 
thcrmocydcr (PerUn-Eltter Ccw, Norwalk, CT) * "^cp; 
cycle" program of 94*C for 2 min, dcoaluraUon and wv» for w 
sec annea&ff and 72 °C for 30 sec. extension, 

Y-chrontoMisnc specific PCR* PCRs (100 pi total reacuon 
volume) ecmtahiing vJ& v&fiXil EtBr were prepared as described 
for HLA-DQa, except wiS* different primers and target DNAs. 
These PCRs contained J $ proolc each male DN A-spcctfic prime** 
YI.) and Yl.2 M , and cither 60 ng male, 60 og female, 2 ng raate, 
ot no human DNA. Triermocycling was W^Tor I min- and S(^C 
for 1 min using a "rtcp-cyde* program. The number of cycles for 
a sample were as indicated in flgui* 3. Fluorescence measure- 
ment is described below. ^ % . 

Allck-spcCTfic, human 0-globiii gw*e PCIL Amplificanons of 
100 »U volume' ntmg 0-5 M#ml of ^ T wcrc prcjf>ami 
described for HLA-DQ* above except with diff ercnt pruncr* anO 
target DNAs. These PCRs comaintd eiiber. pHmcr wr H(jW 
HB MA <w8oMype globin specific primers) or HCP2/Hpl*$ 
lc-giobin spcafic primers) at 10 pmole e*ch pnmcr pcrFCR. 
TStse primers were developed by Wu ct aL 2 \ Three different 
tatgei DNA a were tucd in separate amplification*— 50 ng each, of 
human DNA that was homozygous for (he sfcklc trait <$S), DNA 
that was heterorygow for the iteWe trak < A$>. or DNA that 
homozygous for the W.t. globm (AA). Thcrmocycfing was for » 
cycles at 94"C for 1 min. and 55>*C for 1 min. itsmg fl •'stcp-cycte 
pWram. An annearmg Wmpcnuure of 6B°C ^h*d been shown M 
Wo et al, aJ to provide alldc^pccific awplitouon- Completed 
PCR* were photographed through a red fito t &**^™V 
after pladh/thc reacfion lube* aSp a model TM^36 iranflffl.unfti 
natof <UV-productS San Gabriel, CA). 

Fhiore^evWemeasiiranetft. Flworeseeiwc rocasuieraenw wcr« 
mao> on PCRs containing EtBr in a Fluorolog-2 fluorotneter 
(SPEX. Edison. NJ). Excitation was at the 500 nm band « 
ihout 2 nm bandwidth with «; GO 43S nm cm^ff -fflwr jMcl^ 
Grist Inc., Irvine. CA) to exclude second-order light, tortteo 
y R ht was detected a< 570 nm with a bandwidth of about 7 nm. An 
Cfe 530 »m cut-off filter was used to remove the excitation 

ContftafcOttft ftMwesccnce inoiutormg of PCR* ConpniKm^ 
monitoring of a PCR in progress i was accompiwhed ^ tiiC 
spcctrofiuSromeicr aud Jetringa deserved above as J* » 
fiberoptic accessory (SPJ&X cat no. 1950) to ^th send cxowuob 
light to. and rccei^ emitted light from, a PCR placed m a wcU^ 
a model 480 thwmocydcr (Perk'm-Elmet vCetus). The probe : 
of the nbcroptic cable was attached with "5 mtoutc-cpoxy to jc 
open top of a PGR tube (a 0.5 ml poivpropytew: centnf^cwW 
with its cap removed) cfTeciweW scahog n. The exposed" top ^ 
the PGR tube anfi the end of the fiberoptic cable were shjc] ded 
from room bght and the rooto lights were kept duttmed dutmg 
each run. The monitored FOR was an amplificauon of V-«*^ 
fXKWome-spcdfic repeat sequences as d^bed above, cxcc|n 
usingan anncabWextension lemperauirc of 50°C. Tne r<acw» 
was covered witii minerd oil (2 drop*) to proem ^aporanon- 
TrKrroocycUng and fiuorcsccncc rocasurcmcQt were ^ur^eo *» 
multancously- A time-base son with a 10 second mtegranort can* 
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ua* u«5d and the embuoa signal was ratioed to tbc excitation 
fliffOA) u> control for change* in Ji$ht-»ourcc intensity. Data were 
fleeted using the draSOOOf, version 2.5 (SPEX) data system. 

We thAiut Bob Jones for help with the spectrofluormctnc 
ff«vuti rci»ci*i.ts and HcaiherbeU Fongfor editing this manuscript. 
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IMMUNO BIOLOGICAL LABORATORIES 



SCD-14EUSA 

Trauma, Shock and Sepsis 




The CD- 14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-14 is a receptor for lipopolysaccharide 
(LPS) completed to LPS-8inding-Protein (IBP), "me 
concentration of Jts soluble form is aflered under 
certain pathological conditions. There- is evidence for 
an important rale of $CD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monitoring these patients. 



1BL offers an ELISA for quantitative determination of 
soluble CD-14 in human serum, -piasma, ceikajtture 
supernatants and other biological fluids. 
Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/mJ 
detection limit: 1 ng/ml 
CV: intra- and interassay < 8% 

For more information caH or fax 



GESELLSCHAFT FUR IMMUNCHEMIE UND -BIOLOGIC MBH 

0STERSTRASSE 86 - D - 2000 HAMBURG 20 - GERMANY - TEL. +40/491 00 61-64 ■ FAX + 40 /40 11 98 ,jj 
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.1MULTANE0US AMPLIFICATION AND DETECT10H C 
SPECIFIC DMA SEQUENCES 

Himchi* Gavin. Dolliager 1 , P. Scan. Walsh and Robert Griffith 

^4f,0g, * Corresponding author. ^ _______ 




We have enhanced the polymerase chain 
taction (PCJR) such that specific 1>NA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of e_-idium bromide 
(EtBr) to a PCR, Since the fluorescence of 
EtBr increases in the presence of double- 
stranded (ds) DNA an jtaerease in fl-ores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
bc continuously monitored in order to 
foUow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

lthough the potential benefits of PCR 1 to clio- 
^&gno4t£ arc «ctt known", it U stifl not 
widely used in this jetting, even though tt » 
A _J__ fmir years aoaa thciroikwtaW* DNA poty-jer- 
m«* made PCR practical. Son* of the reasons fot it* slow, 
mxeptam* are high cost, tack of automation of P^- and 
post-FCR proofing steps and false Y>*^"^™L 
arryovCT-cOntaminadoD. The first two points arc related 
in that labor is the largest contributor to cost at the present 
stage of PCR dcvrf4««^ Most Current assays injure 
•o£c form of "downstream" processing once *««°cy. 
ding « done in order to determine whether the target 
DMA sequence was present a«d h»»*.Jh« 
include DNA hybridisation'-*, gel getfropbo-Mi wtth ^or 
without useof rUictiondigesdon^HPt^, or «pdtary 
deorophorcsi* 19 . These meOiods *- J^J 
low ihrotiehpuu and are difficult to autoniate. The third 
point is abo closely related to downstream processing. 
The handling of the PCR product in ^ d 0 ^^?, 
processes increases the chances that amplified DNA will 
spread through die typing lab, resulting m a itsk of 



carryover'' false positives in subsequent testing . 
These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA. took place simultaneously within an unopened re- 
action vessel Assays in. which such different processes take 
place without the uccd to separate reaction components 
have been termed Imogen-Otis'. No mdy homoge- 
neous PCR assay has been demonstrated to dat^aUhough 
progress towards this end has been reported. Chehab, « 
jj.ii? developed a PCR. product detection scheme u«u.g 
fluorescent primers that resulted in a fiuor^«.t PCR 
product Allck-specific primers, each with different Huo- 
?_Snrtags, wcrTusedto indicate the gcnotyp^ ofthe 
DNA. However, the unincorporated primers uiust stul « 
removed iaadownsrream procesi ^^^f^ 
result Recently, Holland, et developed aaassayin 
iwhich the endogenous 5' c*onudease assay of Taq DNA 
Unerase w^tploitcd to cleave a bbe^gonudeo- 
Sde probe. The probe would only ck*ve if PCR »p»- 
cation had produced its conrpjementary 
order to detect the cfcavage products, however, a suose- 

and PCR product detecdon based upon the 
cfelsed fluorescence that ethidium btoinide^ and other 
DNA binding dyes exhibit when they are bound to_ds- 
DNA 1 " . As outbned in Figure 1, a proiotypK PCR 
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nCVBE 1 -fVindplc of simultancoua amplification and^dcl^aion of 
PCR product. The co_npoatmtiofa ^Rcotawnh^ Ei»r^at sro 

daDNA, Thctt is a large fewrcscwcc cnhaaconciit » 
bound to DNA awl tSn_lih£M gi^rty chance* j^tjn DNAjS 
doubte-uxanded. Ato su&dcnt <n) . ijcte of ^ PGR. the net 
increase in dsDNA results in additioDai EtBr bmdtag. and » net 
increase in total flaoTEScencE: 
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MV&fi 2 Get elcctropltoresis of PCR amplification products of chc 
human, mtdcar gene, HLA DQa, made in the presence of 
increasing amounts of EtBr (up to 8 p-g/ml). The presence of 
fctijr iws "no obvious effect on the yield or specificity of amplifi- 
cation. 



A. 




B. 




nOOK $ (A) Fluorescence measurements, from PCRs that contain 
0.5 p-g/nJ EtBr and that are specific for Y<htf>tno*>!ioe repeat 
sequence*. Frvc replicate PCRs were begun containing catch of the 
DNAs specified. At each mdicstcd cycle, one of the five replicate 
PCRs for each DNA *as icro o vcd from thcnnocydrrig and tts 
fluorescence measured. Unit* of fluoxww:cnee art arbitraiT. {£) 
UV photography of PCRtubef (0.5 ml £ppcndorf «tylcv potypro- 
pylcne mtcro-centrifugc tubes) contammg reactions, those start- 
ing from 2 ng male DNA And control reactions without any DNA, 
from (A). 



begins with primers that are single-stranded DNA (ss- 
DNA)» dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from suigrle-ceU amount* of DNA 17 to 
micrograms per PCR- 8 , If EtBr is present, the reagents 
that will fluoresce, in order of increasing fluorescence, are 
frxe EtBr itself, and EtBr bound to the stngk-stranded 
DNA primers ami to the doublc^tranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubk>hcfix). After the first denaturation cycle, target 
DNA will be largely single-stranded. After a PGR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several mkaxtgrarns- Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluprcs- 
ccncc> There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
ts much Jess than to dsDNA, the effect of thw change on 
the total fluorescence of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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bcibrc and alter, or even continuously during, thennocy. 
ding. 

RESULTS 

PGR in the presence of EtBr. In order to assess the j 
affect of EtBr in PGR, amplifications of the human HiA 
DQet gene >9 were performed with the dye present at 
concentrations from 0,06 to 8,0 ugfml (a Lyntca} concea- 
tration of EtBr used tn staining of nucfek: acids following 
get electrophoresis is 0.5 u.g/m(). As shown in Figure 2, gel : 
electrophoresis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concenUarJorj$, indicat- 
ing that EtBr does not inhibit PGR, 

Detection of human Y-chromo«Hn«s specific se* 
arrestees* Sequence-specific, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 jtg/m| EtBr and primers 
specific to repeat DNA sequences found on the humaa 
Y-chromosomc 26 . These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA, After 0, 
17, 21 , 24 and 29 cycles of thenuocycltng, a PGR for each 
DNA was removed from the thermocyclen and its, fluo- 
rescence measured in a spectroflnorometer and plotted 
vs. amplification cycle number (Fig. 3 A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cycle number. 
As shown f the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not sigriificandy increase 
for negative control PCRs, which contained cither no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel eieewphoresi* on the products of these 
amplifications showed that DNA fragments of the ex- 
pected size were made in the male DNA containing 
reactions and that Me DN A synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
trarisilhirninatOT and photographing them through a red 
filter. This is shown in figure SB tor the reactions that 
began with 2 ng male DNA and those with no DNA* 

Detection of specific allele* of the human $-globin 
gene. In order to demonstrate that this approach has 
adwruatc spedficity to allow genetic screening, a dttcction 
of trie $fcldc-ceu anemia mutation was performed. Figure 
4 shows the nuoresccn.ee from completed amplification* 

containing EtBr (0.5 titfad) a* detect** by photography 

of the reaction tubes on a UV ttan^uminator. These 
reactions were performed using- primcrR specific for ci- 
ther the wild-type or sickle-ceil mutation of the human 
p-globin gene* 1 . The specifkity for each allele is imparted 
by placing the sickle-mutation site at the tcnninaJ 3 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension— and thus am- 
ptifkatjem-^can take place only if the 3' nucleotide of the 
primer is conip.Ierncntary to the p-gJobui allele present . 

Each pair of arnrJfficauons shown in Figure 4 consists ol 
a reaction with either the wiM-typc allele specific (left 
tube) or stcklc-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wiid-typc p-gjobin individual (A A); from a heterozygous 
sickle p-globin individual (AS); and from a homozygous 
sickle p-$ofcn individual (SS). Each DNA (50 ng genomx 
DNA to start each PGR) was analyzed m triplicate (3 patrs 
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nf pactions each). The DNA .type was reflected m the 
Irktive fluoreseetice intensities in each pair of completed 
^lificMkMi*. There was a significant increase in fluores. 
SSc only where a ^globin aQele DNA matched the 
orimcT set. When measured on a spcctroflnororactcr 
Main not shown), this fluorescence was about three times 
Sl t present in a PGR where both p-globin alkies were 
mfamatched to the primer set. Get ckarophofes* (not 
Shown) established that this increase in fluorescence was 
H«e to the synthesis of nearly a microgram of a DNA 
foment of the expected size for ^^J^ 1 *™ 
ittte synthesis of dsDNA in reactions m which the aflele- 
-nedfic primer was mismatched to both alleles. 
* Coorimiowj m<wwtoriog of a PGR. Using a fiber opuc 
devkcrit is possible to direct excitation nomination from 
i specuofluorometer to a PGR undergoing thcrmocychng 
and to return its fluorescence to the iqpectrcRuoromcter. 
The fluorescence readout of such an anrangeraent, di 
rcctcd at an EtBr-concaining amplificanon of Y-chrojmo- 
some specific scgvcikcs from 25 ng of human male DN£ 
IT shown in Figure S. The readout from a control FCR 
with no target DNA is also shown. Thirty cycles of FCR 
w «rc monitored for each- m 

The fluorescence trace as a function of time dearly 
shows the efFect of the thermocyciing, Fluorescence inten- 
sity rises and. Calls inversely with temperature The fluo- 
rescence intensity is minimum at the denaturauon tenv 
ocrature (&4°C) and raaxiriiuin at the anneal^g^xlenston 
temperature (50°C). In the negative-control FCR, these 
fluorescence maxima and minima do not change signro- 
csintly over the thirty tbcrxnocycte, indicating that there is 
Kite dsDNA synthesis without the appropriate target 
DNA, and there is Utde if any bieacrungof EtBr during 
the continuous Ulunrinattan of the sample. 

In the PGR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begm to 
increase at about 4000 second* of theripocycl^g, and 
continue to increase with rime, indicating that dsDNA is 
being produced at a detectable JeveL Note that the fluo- 
rescence minima at the denatuxation temperature, do not 
fligtuficandy increase presumably * ^^P^I 

aturc there is no dsDNA fof EtBr to bind. Thus the course 
of the amplification is followed by tracking the fluores^ 
cence increase at the annealing temperature. Analysis of 
the products of these two amplifications by ^d**?^ 
rois showed a DNA fragment of the ejepe^ ^ for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sampte. 

DISCUSSION 

Downstream processes such as r^ridi7^on to a se- 
Quencenfipedfic probe can enhance die specificity of DNA 
dcteuii v .rb> FCR. The chmioatkm of ^ procewe? 
means that' the specificity of this homogeneous assay 
depends solely on ihat of tCSL In the case of ?*Wf^*» 
dScaac, we have shown that PGR alone has sufficient DNA 
sequence specificity to permit geiicdc screening. Using 
ap^ropriaie amplification conditions there is hnte nw- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The sp<windty required to detect pathogens can be 
more or less than that required to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which retires detection 
of a viraJ genome that can be-at the level of a few copies 
per thousands of host cells 6 . Compared with genetic 
screening, which is performed on ceils containing at feast 
one copy of die target Boquence. IttV tfetecuon requires 
both more specificity and the input of more total 
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washed to carry- excitation light to a ^^P^^^ 
emntfid Ught back to a fluorotneter (see H^rnc^I^ocol). 
Amnfficalon Wing human nudo-DNA spedfic pnmcnt in a PCR 
Si^ ^ n| of human male 

^thowt DNA (bottom), were, monnorcd, Thinvcyda of 
PCR w^folWcd for each. The tempcratinx cycled betweet* 
9^C ^nVradon) and 50*C CaniHalin? and extonsjon). Note m 
Sc PNA PCi^.the cycle ^> f ^^ GreMC W 
ftuor«ccnce at the anneaEng/extenaioB temperature 
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DNA up to microgram amoun&~"in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DMA rti an amplification si&M&tantly whereases 
the bacl^rround fluorescence over which any additional 
fluorescence produced by PGR must be detected. An 
additional complication that occur* wkh targets "m low 
copy-number is the formation of die "primer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer a* a template. Although this occurs 
infrequently, Once it occurs the extension product is a 
substrate for PCR amplification, and can compete whh 
true PCR targets if those targets are rare. The primer- 
dimcr product it of course dsDNA and thus is a potential 
source of false signal in this homogeneous a*$ay. 

To increase FCR specificity ana reduce the eScet of 
primer-dimcr airtplificarion, vre are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications that take place in a single tube*, and the 
"hot-start", in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Prdhninary results using these ap- 
proaches suggest tbatprimer-dirocr is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of ID* ecdU. With larger numbers of ccHs, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequence -specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomk DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to 
the oligonucleotide primer**. 

We have shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PGR is completed and continuously during 
thermocyefing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The ttuorescence analysis 
of completed PCRs is already possible with existing instru- 
mentation in 9G-we!l formar *. In this format, the fluores- 
cence in each PCR can be quantitated before, after, and 
even at selected points during therraocyciing by moving 
the rack of PCRs to a ^microwcH plate fluorescence 
reader 20 . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure S shows that 
the larger the amount of starting target DNA, the sooner 
during Pf.R a fluorescence increase is detected. Prelimi- 
nary experiment* <Wiguchi and DolHnger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely* if the number of target molecules is 
Vnown — as it can be in genetic screening-rcontinuous 
monitoring may provide a means of detecting fahc posi- 
tive and false negative results With a known number of 
target inolecuks, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR* 
Increases in fluorescence detected before or after, that 
cycle would indicate potential artifacts* False negative 
resuks due to, for example*. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying markcT. This marker results in a 
fluorescence increase only after a large number of cy- 
cles—many more than arc necessary to detect a true 
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positive. If a sampJc fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event before any test based on this 
principle is ready for the chriicy an assessment of it* false 
positive/false negadve rates wDl need to be obtained using 
a large number of known samples. 

In summary, the inclusion ba PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect speci&c DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
n applications that demand the high throughput of 
samples* 

EXPERIMENTAL PROTOCOL t . ^ 

Hamas HLA-DQn gtttt *mpKBcat*o»s containing EtBr. 
PCRs were set up in 100 **l volumes confining 10 mM Tris^HCK 
pH 8.3; 50 mM KC1; 4 mM MgO z : i£ units of Toe DNA 
polymers (FterVm*E}mcr Genu. Norwalk CT); 20 pinole each 
of human HtA-DQa gene specific oligonucleotide primers 
(JH26 and CH27 19 and approximately 10* copies of DQ& PCR 
product diluted from a previous reaction. Ethidium bromide 
I EiBr; Sigtwv was used at the concentrations indicated in Figure 
2. Thermocyding proceeded for 20 cvcles in a mode! 480 
Ihcrmocydcr (Pexkin-EJ flier Ceu», Norwalk, CT) uaj ng a "step- 
cycle" program of 94*C for 2 mm-dciiaturauon afKt otrr» W 3W 
sec. annealing and 72°C for 30 see. extension, 

Y^cbroinovomc specific PCR* PCRs (JO0 ul total rcacuon 
volume) containing 1U» **e/m* ^tBr were prepared as described 
for HLA-DQ&, except with different primers and target DNAs. 
These PCRs contained J 3 pmolc each male DNA-spccific primer* 
YI.] and Vl.2 w P and either 60 ng male, 60 eg female, 2 ng mak. 
or no human DNA. Thermocycling n**94*C?or 1 min- and WX: 
for 1 min using a "stcp-cyde* program. The number of cycles for 
a sample were as indicated in tfgui* 3. rluorcsccnce measuro 

me*tt is described below. t 

AUck^spccmc. human £-gloMn. giro* PCR, Amphncauons of 
100 v-\ volume' «*m& 0 5 of £tBr were prejfttrcd a; 

described for HLA-DQ* above except with different primers and 
target DNA&. These PCRs contained either P^mer pair -H^nf 




Utgei DNA* were ti*cd in separate amplifications?— &0 ng eaca or 
human DNA that was hoinoxygous for the JtfcVte trait (5S)* DNA 
that was heterozygous for the sickle traH (AS), or DNA that vrtu 
homozygous for the w.i- globia (AA). ThcrmocycEng was ffir 30 
cycles at for 1 min. and 55*C for 1 min. using a "step^yefc 
pUram. An armealmg temperature of 6S°C h*<X heen shown to 
Wu et al. 2J to provide alldc-spccinc awpliftcaUon. Omtpjcted 
PCRs were photographed through a red fitter /WnLttcrr23A) 
after placing' the rMcSon t«b<* f*iop a model TM-36 transIH.umi- 
nator <W V-products Sah Oabricl, CA). 

Fhioreseenee measurement. Fluorescence mcasurcracn w rwet* 
madd on PCRs containing EtBr in a Fluorolog-2 fluororncurr 
(SPEX, Edison. NJ). Excitation was at the 500 nm i>and v jth 
ihourS nm band4lth with *GQW nrn 
Crist Inc.. Irvine. CA) to exdudc scW^rder ^ 
li K hl was detected at 570 nm with a bandwidth of about 7 nm. An 
OG 530 om cut-off fitter was used to remove the exotauon 

CondtHtouft ffaK>re5)cence xamitormg of FCR, Co»>tinuOu* 
monitoring at a PCR in progress ^ accomplished usin^ m c 
spcetroftuSrometer and settings descrrbed Above as well as a 
fiWoptk accessory (SPEX at. na 1950) bo^ad 
light to. and rccei^ emitted Ught from, a PCR pW in a weUj « 
a model 480 inwwxyclcr (PerVm-Elmer Cetws). The probe e*a 
of the nbcroptic cable was attached vrth "5 mro utc-cpoxy to t»o 
open top of a PCR tube (a 0.5 ml pc4ypropyienc centrifuge tuoe 
with its *p removed) effectively scaling ic The c*po«^tc£tf 
the PCR tube ana the end of the fiberopuc caWe were shjewea 
from room light and the room lights were kept dmtmed durmg 
each run- The monitored PCR was an amplication of V-chic- 
riKwdrm-spcdnc repeat sequences as described above, c^ccF 
u*hw an anneahng^extension wmperauirc of 50°C. Tnc Teacow" 
was covered ivith 9 mu>erJil oil (2 drope) to prwnt evaporation- 
TbermocycUnff and fluorescence rocasurcment vere started 
multancously, A tirn^basc scan with a 10 second mtegranOi> tw«c 
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km uwJd *nd the emboli signal was ratioed to the excitation 
•aerial to control Ibc chawfte* b Jight-»ourcc intemtity. Data were 
eSlcc^d using the droSOOOr, version 15 (SPEX) data system. 
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IMMUNO BIOLOGICAL LABORATORIES 



SCD-14EUSA 

Trauma, Shock and Sepsis 




The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-U is a receptor tor lipopolysaccharide 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concentration of tts soluble form is aftered under 
certain pathological conditions. There- is evidence for 
an important role of $CD-14.with pdytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monitoring these patients. 



1BL offers an ELISA for quantitative determination of 

soluble CD-14 in human serum, -plasma, cell-culture 

supematants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/rm 
detection limit: 1 ng/ml 
CV: intra- and interassay < 8% 



For more information caH or fax 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

PeTkin-Elmer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
in reporter fluorescence intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' -*3' nucle- 
olytlc activity of Taq DNA polymerase. 
In this study, probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 '-end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an Increase In reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



#^ homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. (1) 
The assay exploits the 5' -» 3' nucle- 
olytic activity of Taq DNA poly- 
merase^ and is diagramed in Figure 1. 
The fluorogenlc probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET). (4 ' S) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' -> 3' nudeotytic 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an Increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3 1 end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 



MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained . from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-Elmer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeIed phos- 
phoramidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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Reverse 
Primer 



5'. 

3*- 



-3' 



Cleavage 




Polymerization completed 




5*. 
3'- 



-3' 
■5' 



mM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x 4.6- 
mm column with 7-jun particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PCR amplifications were performed 
In the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-^1 reactions that con- 
tained 10 mM Tris-HCl (pH 8.3), 50 mM 
KC1, 200 mm dATP, 200 |im dCTP, 200 jxM 
dGTP, 400 \iM dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actm 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajlma-li|ima et aL) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (8> Actin am- 
plification reactions contained 4 mM 
MgCl 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of A 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl* 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95*C (20 sec), STC (I min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 
aliquot of a sample was transferred to an 
individual well of a white, 96-well micro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a S15-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q. value)' using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



TABLE 1 Sequences of Oligonucleotides 



F119 
R119 
P2 
P2C 
P5 
PSC 
AFP 
ARP 
Al 
A1C 
A3 
A3C 



primer 
primer 
probe 

complement 
probe 

complement 
primer 
primer 
probe 

complement 
probe 

complement 



ACCCACAGGAACTGATCACCACTC 
ATGTCGCGTTCCGGCrGACGTTGTGC 
TCGCATTACTGATCGTrGCCAACCAGlp 
GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATG ACAAG G AJjp 
TTCATCCTTGTCATAGATACCAGCAAATCCG 

m^^^a^a rrnrrrr ATfTACGA 



AGACGCAGGATGGCA I OWii^^ 
CGCCCrGGACTTCGAGCAAGAGATp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC 



no*-* r— ; . 

ta each oligonucleotide used in this study the 

5' 3' direction. There are three types of oligonucleotides: PGR pnmer, uuo ™»*" J* - the 
n i please assay, and complement used to ^^^^^^ 
probes the underlined base indicates a position where LAN with TAMRA attacnea 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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Probe 



>temp. 



A1-2 

A1-7 

A1-14 

A1-19 

A1-22 

A 1-26 



518 nm 

+ temp\ 



raqgcxctcccccatgccatcctqcgtp 
ratgcccqcccccatgccatcctgcgtp 
ratxxcctcccccaqgccatcctgcgtp 
RatgccctcccccatgccaQcctgcgtp 
ratgccctc(xccatgccatc?cqgcgtp 
RatgccctcccccatgccatcctgcgQp 



582 nm 
no temp. +temp. 



RO- 



TO* 



ARQ 



A1-2 25.5+.2.1 32.711.9 38.2 + 3.0 38.2±2.0 Q.67 + 0.01 O.B6± 0.06 0.19 ±0.06 

A1-7 53.51 4.3 395.1121.4 108.516.3 110.315.3 0.4910.03 3.5810.17 3.0910.18 

A1-14 127.014.9 403.5+19.1 10B.715.3 93.116.3 1.1610.02 4.3410.15 3.18±0.15 

A1-19 187.5117.9 422.7 1 7.7 70.3 1 7.4 73.0 1 2.8 2.67 1 0.05 5.80 1 0.15 3.13 + 0.16 

A1-22 224.6 1 9.4 482.2 1 43.6 100.0 1 4.0 96.2 + 9.6 2.25 1 0.03 5.02 1 0.11 2.77 1 0.12 

A1-26 160.2 1 8.9 454.1118.4 93.115.4 90.7 ± 3.2 1.72 1 0.02 5.0110.08 3.29 1 0.08 

FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average±l s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ" and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-weil variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ") from the RQ value for the com- 
plete reaction including template 
(RQ*). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that Is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, ail five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes AM4, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR, 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe'with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ - values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or ^-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ- 


RQ + 


ARQ 


A3-6 
A3-24 


54.6 ± 3.2 
72.1 ± 2.9 


84.8 ± 3.7 
236.5 ± 11.1 


116.2 ± 6.4 
84.2 ± 4.0 


115.6 ±2.5 
90.2 ± 3.8 


0.47 ± o;o2 
0.86 ± 0.02 


0.73 ± 0.03 
2.62 ± 0,05 


0.26 ± 0.04 
1.76 ± 0.05 


P2-7 
P2-27 


82.8 ± 4.4 
113.4 ±6.6 


384.0 ±34.1 
555.4 ± 14.1 


105.1 ± 6.4 
140.7 ± 8.5 


120.4 ± 10.2 
118.7 ± 4.8 


0.79 ± 0.02 
0.81 ± 0.01 


3.19 ± 0.16 
4.68 ± 0.10 


2.40 ± 0.16 
3.88 ± 0.10 


P5-10 
P5-28 


77.5 ± 6.5 
64.0 ± 5.2 


244.4 ± 15.9 
333.6 ±12.1 


86.7 ± 4.3 
100.6 ± 6.1 


95.8 ± 6.7 
94.7 ± 6.3 


0.89 ± 0.05 
0.63 ± 0.02 


2.55 ± 0.06 
3.53 ± 0.12 


1.66 ± 0.08 
2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PGR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase In RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2+ effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 * concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2or Al-7), the RQ value at 
OffiMMg 2 ' is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al- 19; 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 nriM Mg 2 * followed by 
a gradual decline as the Mg 2 * concen- 
tration Increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 24 ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg* + effects support 
the notion that quenching .of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUS5ION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This Implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nrn 



RQ 



Probe 



ss 



ds 



ss 



ds 



ss 



ds 



Al-7 

Al-26 

A3-6 

A3-24 

P2-7 

P2-27 

P5-10 

P5-28 



2775 
43.31 
16.75 
30.05 
35.02 
39.89 
27.34 
33.65 



68.53 
509.38 

62.88 
578.64 

70.13 
320.47 
144.85 
462.29 



61.08 
53.50 
39.33 
67.72 
54.63 
65.10 
61.95 
72.39 



138.18 
93.86 
165.57 
140.25 
121.09 
61.13 
165.54 
104.61 



0.45 
0.81 
0.43 
0.45 
0.64 
0.61 
0.44 
0.46 



0.50 
5.43 
0.38 
3.21 
0.58 
5.25 
0.87 
4.43 



fss) Sinale-stranded. The fluorescence emissions at 518 or 582 nm fox solutions containing a filial 
hm indicated probe, 10 mM Tris-HCl <pH 8.3), 50 mM KC , and 10 mM MgQ, 
S&tnM me solutions contained,*, addition, 100 nM A1C for probe, Al-^and 
Al-26 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 dm 
Si »ta »iE3p«* Before the addition of MgCl 2 . 120 4 of each sample was heated 
S£ £££ Following the addition of 80 >1 of 25 mM MgCl, 

cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 
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matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA Is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PGR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ . r which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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mM Mg 

FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Trls-HCl <pH 8.3), 50 mM KG, and varying amounts (0-10 mM) of MgCi 2 . The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCI 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor Is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe/ 0 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3 r end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ~ values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ" than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the PS probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PGR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the S' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed foT two probes 
with internally attached TAMRAs. w This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. u> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the S' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybrtdiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al.< 10 > describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR amplifi- 
cation. 
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Quantitative nucleic acid sequence mialysis has 
had an important ml* in many fields of biologi- 
cal research. Measurement of gene expression 
(RNA) has been used extensively In monitoring 
biological responses to various stimuli (Tan el al, 
1991; Huanft el al. I995a,b; Prud'homme et al. 
1995). Quantitative gene analysis (T;NA) has 
ln-en used to d«iermine the gen tin u.* quantity of a 
particular gene, as in the case, or the human HER2 
gene, which Is amplified in -30% of breast tu- 
mors (Slarnon ct a!. 1987). Gene and genome 
quantitation .(DNA and UNA) also have been used 
fur analysis of human immunodeficiency virus 
(HJV) burden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; Matak ct al. ]vv:hi; 
Purtado <?l ai. 1W5)- 

Many methods have been described for rtu: 
quantitative analysis ot nucleic acid sequences 
(both for UNA and DNA; Southern VJ/b; Sharp ct 
al. 1980; Thomas 1980). Recently, PGR lias 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase (KT)-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made pos- 
sible many experiments that could not hove, been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 
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that it be uaal properly for quantitation (U»«y- 
maskers 1995). Many early reports of quantita- 
tive PCK and RT-PCR described quantitation of 
the PCR product but did not measure, the Initial 
target sequence, quantity. II is essential to design 
proper controls for the quantitation of the initial 
target sequences (Hcrrc 1992; Clcmentl ct al. 
100?.) 

KcNMirchexs have, developed several methods 
of quantitative PCR and HT-FCR. One approach 
measures PCR product quantity in the log phase 
of the re»ci ion before the plateau (Kellogg et al. 
]000; Pang et a). 1 W0). This method requires 
lhai each sample has equal input amounts of 
• nucleic- add and that each sample under analysis 
amplifies with ident leal efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained In all samples at relatively constant quan- 
tities, such a.s p-aelln) tan be used for sample, 
amplification efficiency normalization. Usin« 
conventional methods of PCU detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene and the 
normalization gene), Another method, quanllta- 
live competitive (QQ-KIR, has been developed 
and i* »sed widely for PCR quantitation. CXM'CR 
relics on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Platan el al. 1»IW>)- The efficiency of each re. 
action is normalised to (he Internal competitor. 
a wnnwn amount of Internal competitor can be 
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! added to each sample. To obtain relative riunnU 

V ration, the unknown target PGR premium is com- 
! pared with the known competitor K".K product, 
^ Success of a quantitative competitive l*CU assay 
t reli«s on developing an Internal control that am- 
l pYifivs with the same efficiency as the Utgvl 

1 cculc. The design of The coiupetJtux and the va)l- 

V datlon of amplification efficiencies icquirc a 
s^i dedicated effort . However, because QC -PC :r does 

nut require that PC:K punlucts be analyzed during 
t ' the log phase of the amplification, it is tin: easier 

erf the two methods to use. 

Several detection systems «ie used for quan 
Utalive PCX and RT-PCtt analysis; 0) as 5 " 1 ** 0 
gels, (2)iluurescenl labeling of PCU products and 
Ctclccilon with tn.iiTT- induced fluorescence using 
capillary electrophoresis (Pasco et al. 1995; Wil- 
liams et al. 1996) or acrylanildc gels, and (3) plate 
capture and sandwich piobe hybrid I /.a lion (Mul- 
der el al. 1994). Although these method* pmvrd 
successful, each method requires posl-PCR ma- 
nipulations that add Time to the analysis and 
may lead to hibumtuty i on I a mi nation . The 
sample Qiroughpul of these methods i.s limited 
(wi|]i the rxccpllon of the plate capture ap- 
proach), and, thertrfnre, these method* art.* not 
well suited fin u-so dciiicwidin£ high snmplc 
Throughput (I.e., screening of huge numbers of 
hliuiu>ltt\.ultr> oi analy/in^ SAmplus i'ux diagnus- 
N.'* tic* or cl in ico I trials). 

Here we report the development of it novel 
assay for quantitative T^NA analysis. The assay is 
based on Die us«-t>f the 5' nuclease assay first 
described by Holland et al. (1991). The method 
uses the -V nuclease activity of 7Vu/ polymerase to 
cleave a noncxtcndlblc hybridization probe dur- 
ing the extension phnse of 1'CU- The approach 
uses dual-labeled fluorogcnic hybridisation 
probes (Lcc ct a). 1993; nasslcr ct al. 1995; Uvok 
et al, Jy9fjo,b)- One fluorescent dye svrvv.3 as rt 
reporter |PAM (i.e., (^carbpxyfluoresvcin)l and its 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (i.e., n-cai-boxy-tetramethyl- 
rhodamtnc). The nuclease degradation of the hy- 
brtdi/Jitton probe releases the quenching of Die 
I'AM fluorescent emission, resulting in an In- 
crease in peak fluorescent emission at S3« wtk 
The use Of a sequence detector (All! Prism) allows 
measurement of fluorescent spectra of all 96 wells 
uf The thermal cycler continuously during the 
1*CK amplification. Therefore, the reactions aje 
monitored in real Lime, The output data is de- 
scribed and quantitative analysis of input target 
I )NA sequences is discussed below. 
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RESULTS 

PCR Produce Derecrlon in Rwai Time 

The goal was to develop a high-throughput, sen- 
sitive, and accurate gene quantitation assay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A plasm id Amending human factor 
Vlll gene sequence, pl'8TM (see. Methods), was 
used as a model therapeutic rwui. The assay 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence in real 
time (AB1 Prism 77(H) Sequence Dcicrinr). Hie 
Taqman reaction requires a hybridisation jwnhr 
lalxlcd with two different fluorescent dyes. One 
dye Is a rcporUir dyu (PAM), the otfw is * quench- 
ing uye (TAMRA). When tlie proU: is intact, fluo- 
icaccnl energy transfer occurs and the reporter 
dye fluorescent emission h absorbed by the 
quenching dye (TAMUA'j. During the extension 
phase of the PCR cycle, the fluorescent hybrid- 
i/jillon prone Is cleaved by the S'-.V nuelcolytic 
activity of the DNA polymerase. On cleavage of 
the probe, the reporter dye emission Is no longer 
transferred efficiently to the quenching dye, re 
sultiiiK In an increase of the reporter dyo fluores- 
cent emt.'wion hp*ctra. VCll primers untl probuw 
were designed ft*i the human factor Vlll se- 
quence and human p-«e.tin gene (a* dv.^c.ril>ed in 
McthtMla). Optimization reactions were per- 
formed to choose the approprlute probe untl 
-magnesium concentrations yielding the highest 
Intensity of reporter fluorescent signal without 
sacrificing specificity. The Instrument uses a 
charKu-coiipIcd device (i.e.. CCD . cmncru) for 
measuring the fluorescent emission spectn> from 
. r ,(K) tfi MO nm. ICach 1*CR tube was monitored 
sequentially f<?r 2& m.Huc wltli continuous moni- 
toring throughout th« aniplifieatipn. Uoci^ lube 
wo.n rn-cxanilncd every B.5 sec. Computer soft- 
ware, was designed to examine the flu orescent In- 
tensity of both the reporter dye (PAW) and 
the quenching dye (TAMRA). The. Ihioresccnt 
intensity of the quenching dye, TAMUA, changes 
very little over the course of the PCR ampllfl* 
cation (data not shown). Therefore, the Intensity 
of TAMKA dye omission serves hs »n hitcriuil 
.itondard with which to nonnullW! the reporter 
dye (PAM) emission variatJons. n^e .software cal- 
culates «i value termed AHn (or AfcQ) using the. 
following equation: ARn - (lln J ) (Hn'). where 
Kn 4 emission intensity of reporter/emission in- 
tensity of quencher al any given time In a -reae 
rion lube, and f emission intensility of tc- 
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poner/emission ImwiMty »r quencher measured 
prior U; lO iimpMicaUon in ihat same ruction 
tube. Tor the purpose of quantitation, the U« 
three data points (ARns) collected during the. ex* 
tension step for each PCR cycle were analyzed. 
The nucleolyiic degradation of the nyoriUiy-ttion. 
probe occurs during ihe extension phas*of rial, 
and, therefore, reporter fluorescent cni»Mun in- 
creases during this time, Hit: llnce dalfl points 
were averaged for cadi cycle and the metm 
value for each was plotted in an "amplification 
plot" shown in J^urc J A. The ARn mean value \s 
plowed on the J'-axls, and time, represented by 
cycie number, is plotted on thv*-axis. During the 
early cycles of the VCR amplification, die ARn 



value remain!; at base line When .sufficient hy- 
bridization probe has been cleaved by the Tim? 
ix>lymvrasc nucleate activity, the intensity of re- 
porter fluorescent emission liivreaiwfr. Most PCK 
aniplifiudions reach a plaUau pbaKc of reporter 
fiuores»ee.h'i emission if the reituliun Is carried nut 
to high cycle nujiilwis. The amplification plot 10 
examined vuiiy in Urn reaction, at a point thai 
ic-j>jcscnts ihe log phase of product arminula* 
turn. This Js done by assigning an arbiUary 
ihrcshoRl thul Ls bawd on the variability of the 
base-line dyu. In Figures U, tlic threshold w*sxct 
at 10 standard deviations above, ihc mean of 
base line emission calculated from tydo 1 lo 1 5. 
Once the threshold is chosen, the point at which 
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the amplification plot sztoum the threshold' u> &<* 
fined as C r . C r is reported aa the cycle number ;n 
this point. Ar will be demons! rut «d, th« CI, .value 
1st picUlcUve of the quantity of input Urge!. 

C,- Values Provide a Quantitative Measurement* of 
input Target Sequences 

Figure IB chows amplification pluU of lii'tMTs*. 
cut PGR amplifications overlaid. 'i*hc amplify 
tions were performed on a 1 :2 serial dilution 'itf. 
human genomic 1WA. 'ITic amplified tatgei vwu 
human p octln. The amplification plots «hifl to 
the right (to higher threshold cycles) n.< the input 
target quantity reduced. 'JTvif. is oxpoctod ho- 
ramxu wmtrtlortK with fewer starting eopiitx of the 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of 10 'stan- 
dard deviations above the base line was used to 
determine trie O r values. Figure 1C represents the 
C r values plotted versus the sample dilution 
value. Each dilution was amplified in triplicate 
PC:r amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C T values decrease linearly with increas- 
ing target quantity. Thus, C r values can be used 
as a quantitative measurement of tlie input target 
mimbe.T. It should be noted that the amplifica- 
tion plot for the 15.6-ng sample shown In Figure 
IB does not reflect the same fluorescent rate of 
increase exhibited by most of the other samples. 
The 15.6-ng sample also achieves endpoinl pla- 
teau at a lower fluorescent value thai) would he 
expected based on the input ON A. This phenom* 
enon has been observed occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
still under investigation. It is important to note 
that tlie flattened slope and ear)y plateau do not 
impact significantly the calculated value as 
demonstrated by the fli on ihr line shown in 
Figure 1 C, All triplicate amplifications resulted in 
very similar Cr values — the standard deviation 
did not exceed 0,5 for any dilution. Tills experi- 
ment contains a > 1 00,000-fold range of Input tar- 
get molecules. Using G r values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. ol iluoresccnt in- 
tensity measurement of the A1U I'rlstn 7700 Se- 
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rmmts over n very large t;ii»|»p nf rfUtivn starting' 
target quantities. 

Sample Preparation Validation 

Several parameters influence the ofhcU-nry nf 
PCR umplification: magnesium and salt coneeiv 
nations, reaction conditions (i.e., time and tem- 
perature), PCH target size and composition, 
primer sequences, and sample purity, All of Tlie 
above (actors are common to a single J'CK assay, 
except sample to sample purity. In an effort to 
validate the. method of sample preparation for 
the iactor VJ11 assay, FCK amplification reproduc 
ihih'ty and olficiency ol 10 replicate sample 
]iref mirations were, examined. After genomic DNA 
was prepared from tlie 10 replicate samples, the 
DNA was quaiHUatcd by ultraviolet spectroscopy. 
Amplifications were performed analyzing p-aciln 
gene, content in 100 and 25 Jig of total genomic 
DNA. Each I'CR amplification was performed in 
triplicate. Comparison of C r values for each trip- 
licate sample shim minimal variation based on 
standard deviation and coefficient of variance 
(Tabic I). Therefore, each ol the triplicate PCU 
amplifications was highly reproducible, demon- 
strating that real time PCR using this instrumen- 
tation introduces minimal variation Into th« 
quantitative J'CK analysis. Comparison of tlie 
mean C n values of the 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing 111 at each sample preparation yielded similar 
results for ft-acltn gene quantity. The highest C. T 
difference between any of the samples was 
and 0.73 for the ](K) and 25 ng samples, respec- 
tively. Additionally, the amplification of each 
sample exhibited an equivalent rate of fluores- 
cent emission intensity change per amount of 
DNA target analyzed as indicated by similar 
slopes derived from the sample dilutions (Pig. 2). 
Any sample containing an excess of a l'CK inhibi- 
tor would exhibit a greater measured 3-actln C r 
value for a given quamliy of DNA. In addition, 
the inhibitor would be diluted along with the 
sample in the dilution analysis (lig. 2), altering 
the expected C,. value change. Each sample am- 
plification yielded a similar result in the analysis, 
demonstrating that this method of sample prepa- 
ration is highly reproducible, with regard to 
sample purity. 

Ouancitadve Analvsis of a Plasmid After 
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(or containing a partial cUNA for lumwn factor 
vni, pi-BTM. A scries of transfcciiom was sot 
up using a decreasing amount of the plasmid\40, 
A t 0.5, and 0.1 u.g), TwtMny-four hours posl- 
transfet'tinn, total DN7A wn> purified from each 
flask uf L-rlla. p-Aclin gene tjuanlily wa> Ukoui 
a value Tor noimali^ai iwii of Kcuumii'. ONA con- 
ccnrniUoii from cadi sample. In this cxpciimont, 
li-actin gene content should remain constant 
relative to rural genomic UNA. Figure 3 shows the 
result of the p-aetln DNA measurement (100 jig 
total DNA determined by ultraviolet spectros- 
copy) Of each sample. Kach sample was analyzed 
hi triplicate and the mean (i-actin Of values of 
the triplicates were plotted (error bars represent 
r*w\**ri M^i^noni *1h#» hlP)i#»sr iiiffcrrnrr 
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betwywn any two sample* moam was 0.9S Ten 
nanograms of total UNA uf each sample were also 
examined ror jVactln. Ihc results «&uJo showed 
that very similar amounts of genomic UNA were 
present; tin-. maximum mean p actio value 
difference wa.s 1.0. As Figure 3 shows, the rale of 
p-aeti» C r diuriKv bclween the 100 and 10-ng 
siunpl« was similar (slope values rang« btttwocm 
3,56 and -3.45). This verifies again that the 
method of sample preparation yields saynplos of 
identical PCR integrity (>.<•-, no sample contained 
an excessive amount uf a VCR inhibitor). How. 
ever, these results; indicate that each sample con- 
tained slight diffeiences in the actual amount of 
genomic ON A analysed. Determination of actual 
KUiiujinc »>NA concent ration wos accomplished 
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Figure 2 Sample preparation purity. 1 he replicate 
samples shown In Tablo 1 wore also amplified In 
tripicate Rising 25 rig of each DNA sample. The fig- 
uic shows the input DNA concentration (100 and 
25 ng) vs. C, In ihr* linurp. th*> 100 and 75 ng 
points for each sample are connected by a line. 



by plotting the mean P-actio C, x value obtained 
for tfatih 1i)0'-llg SHiliplv wn a ft-aciln standard 
truive (shown In J'Sh- 4 ^>- actual genomic 
ON A concentration «f each sumpl«, w, was ob 
tallied by extrapolation to the X-axis. 

Figure 4 A shows the .measured ({."., 
normallr.ed) quantities, of factor VJJ) pin am id 
DNA (pFSTM) from each of the four transient cell 
transections. Each reaction contained 100 ng of 
total sample DNA (as determined by UV spvutro* 
copy). l r -ach sample was analyzed in triplicate 
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Figure 3 Analyst of lidiisfectcd cdl DNA quantity 
and purity. I he DNA preparations of the lour 293 
cell transections (40, A, 0.5, and 0.1 ng of pF8TM) 
were analyyed for the P-actln gene. 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pFSTM that was transfected, the p-actln 
C T values are plotted versus the total Input DNA 



pc;r< amplification*. As shown, pPBTM purified 
jftoic Jbc 293 cells decreases (mean C, values in- 
c*tuii*£) with decreasing amounts of plasmld 
.truitsU'Ucd- The mean C"^ values obtained for 
pWTW inTlgufC 4A were plotted on a standard 
curve comprised uf seilally diluted pKHTM, 
shown .in figure 4ft. The quuiillly oJ pl-KTM, U, 
found in each of the four tranfifcctlonK was de- 
termined by extrapolation to the x uxte of the 
standard curve. In figure 41*. Thcrsc uncorrected 
values, b, for pHfi'M were itoriiHril/ud to del er- 
mine the actual amount of pl'81^1 found per 100 
n« of genomic ONA by lining the equation:. 

b X 100 ng uctual pl : 8TM copies oer 
r 100 ng of genomic DNA 

where a •- actual -genomic HNA in u sample and 
U w- pI'HTM copies from the standard curve, The 
normalised quantity of pt'BTM per 100 ng of ge- 
nomic DNA for each erf the four 1 ranafccilons Is 
shown in Figure 'n<c-Ne results show ihflt the 
quantity of factor Vlll plasmiU associated wnh 
Xftc 293 cells, 24 hr after i run si-fuci'inn, di:ui:.isc:> 
with decreasing plasiuu) uinuunuatjou used in 
the ifan^fcdion. llu: quantity of pl ; «*J'M nsaoeJ- 
atcu wim 293 cells, after trunsfeciloji whh 40 
of pi;isniid, was 35 pg per 100 ng genomic UNA. 
This results in -520 plasuiiU copies per cell. 



DISCUSSION 

We have described a new method for qunntilnt- 
iug gene copy numbers using rtaMlmc analysts 
of PCJR amplifications. Real-time PCK is compat- 
ible with cither of the two PC:K (KT-PCR) ap- 
proach^: (1) quanmative conM*« ln ' lvi: w " c,c An 
Inteuial witipclilOf for each target sequence is 
used for normalisation (data not shown) or (2) 
quantitative comparative 3*011 using \\ nunii*li*a- 
ttou gene contained within the sample (i.e., |3-ac- 
tiii) or a "housttkeeping" gene for UT-PCK- If 
equal amounts of tmcielc add are analy/wl for 
each sample and if the amplification efficiency 
before quamltnttve analysis ^ identical for each 
sample, the Internal cud I ml (nui mali/^ilion gene 
or competitcjr) should give equal signals for nU 
samples. 

The real-time PCH method offers several ad- 
vantages over the other two methods currently 
employed (t>eo Ibc introducUon). I ; irst, the Teal- 
time PGR method is performed in a do.scd-Uibe 
system and requires no past -PCK numipulatlon 
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Flpiirft 4 Quantitative analyiii* of pFSTM in Iran dec Led cells. (A) Amount of 
plasmid DNA used for I he trunsf action plotted against the mcun C t value deter- 
mined for pfSTM remaining fa br attcr Uansfectfon. (D,C) Standard curves, of 
pf-ftYM ^nd fi-acUn, respectively. pfBTM DNA <fl) and genomic DNA (Q were 
diluted SArlally 1:5 before amplification with the appropriate primer*. The p-aclin 
standard cuivq wai; used lo normalise the results of A to 100 ny of genomic DNA. 
(0) The amount of pPSTM present per 100 ng of genomic DNA. 



of Mm p)c. Therefore, I In* potential) for TCH con- 
tamination in the laboratory is reduced because 
amplified products can Ik* analysed and disposed 
of without opening the r turn ion tubes. Second, 
this method suppozU the UMt of a i u/rin;i]jxiilk>iJ 
gene (i.e., |3-actin) for quantitative PGR or house- 
keeping gene."; for ipsuntitnttvc KT-l'CK controls. 
Analysis is performed in real time during the Jog 
phase of product accumulation. Analysis during 
k»K phase permits many different genes (ova a 
wide input target range) lo be anaJy/cd slmulia- 
iu:uu*i]y, without concern of reaching rend ton 
plateau at different cycles, Tim will make imilll- 
gene analysis assays much c.aMv.i Uj develop, be- 
cause individual internal t.wmpetllui> will not be 
necded for each gene under analy&ht. Third, 
^ctinplc throughput will htne<i:>c dramatically 
with the new method because, there is no post- 
rc.;K prncensing time. Additionally, winking In a 
^6-we.il format \n highly compatible with auttv 
million technology. 

The real-time 1>CR method is highly reprn- 
ducible. ft e plicate amplifications can be analyzed 



for ^.ich sample minimizing ]>otcntlttl error. The. 
sysutin allow* feu a very large assay dynamic 
range (approaching 1,000,000 -fold Mailing Uti- 
gel). UmIiik a standard curve for the. target oi in* 
terest, relative copy number values can be deter- 
mined for any unknown *un;pic. Flu orescent 
threshold values, C p condair. linearly with rela- 
tive DNA copy numbers. Heal time quantitative 
HT-PCK methodology (Cilbsuji et al., tills i.wuti) 
ha.i also been do. vd oped, finally, real tlm* quan- 
titative I*CU methodology can be used to devdup 
high-throughput weening aaaay.% for a variety of 
applications [quantitative gcn<; CAf'jea&iou (KT- 
PCft), gene copy aaanyn (Mcr2, I11V, etc.), ,gcni> 
typlng (knockout mouse analysis), and lmmuiio- 

renj. 

Real- time K-ll may al.w he j>crformeil using 
intercalating dye.s (Higuchi ct al. such «s 

oiJvidium bromide. The fluorogenic prone, 
method offers a major advantage over inter- 
calating dyes- greater specificity (i.e., primer 
dimvrs and nonspecific. PCR products are noi de- 
tected). 
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METHODS 

Generation of <t PUsmid Containing a Partial 
cDNA for Human Factor Vlll 

Total RNA v»<»> harvcMcd (UNA*ol » from Tel Test, lue., 
hrjendawood, -TX) from o-M> li-nafecled with 4 Jnetur VHl 
ramnsiJuii vector, pC:iS2.tfe4&» (Kotcm el hI. Cor. 
mnn el al. 1900), A factor VIII partial cMNA wiimthv WAS 
K «-m.micd by irj' \>CM KJoneAmp |<Z iTlh ItNA POl kit 

(pan NWirwnvs, rt Applied itiosysicms, l'cwivi <'<t*y, <-^)J 

uslnft I Ik* POl primers 1-Vfor hiu! Ittrev <|«rin«-r sequences 
arc shown Mow). The ampllcon was reamplinVd ittlnR 
modified l^for and Wrcv primers tsppomleci wlih Hem/Ill 
and HrnclUI restriction site sequence* m «Jiv V emO 
Clonal Into pCiKM- 3Z (IVohk^u CUvp.. MihIligd, Wl). The 
rcsirtllnKclmic, pVSTM, was u»rtl lor transient transection 
of Jt5l3 cells. 



Amplification of Target DNA anil Dclccilon of 
Amplicon Factor VIII Ptamid DNA 

(pOTM) was wnirtlftcrt with lire |»»"»«» FHfof 5'-c:<;c:- 

<rrcKX^\ACiAu:jxjAtxiiCi*iW3* and i : *rw .v-aaacct- 

tUOCXn'GCiAJXitiTAOCi-a'.Hie rvitcLUui pruJuml w 422- 
np K;k product. The forwurcl primer wti» de>b;iied lu let- 
ngnlxc u unique M'ljunm* nmnd In the V untranslated 
region »>f »hu paienl uOS2.lHZ5l> plaMiml nml Mn:(cforc 
Oovs not jvuim"*^ amplify foetnr VIM 

gene. Primnrfi wore choium with the av»ivt<H'r*r* of I he com- 
puter program Olifto (Nuliiwiiil ltUwcicnves, lne„ Ply- 
mouth, MN). The Amman p-actln gene wws amplified wllh 
ihc primer* »-t«-liti forward primer S ' T<JACCOACA< 7Vi ST 
GCCCATCTI'ACOA-.V and p-acliu it-wrtt piirner V.CAO- 

C0GAACCCcri'<:A"n'(i<:cJiA"j'CG-3'. The reaction pro- 
ciueeo o *V5 hp K;u product. 

Amplification reactions (SO jJ) contained a DNA 
sample, H»x P<;« liuffe.r II (5 u.1), 200 n-M dA'ir, dCTP, 
<1GTT\ and 400 jtM dlJTP, 4 tnx< MgCI ? , l.ZS Units Am pi I 

t;tsa polymciwc, 0.5 unit AmpKrow uracil W-jjiy- 

i;n.iylu*v <UNG), 60 prnolv.- of each focloi Vlll jm-Iiik-i, unci 1£ 
ptiinlt* of iMt:h ft Actio pi I m or, '11 m icartUniK omlotucJ 
OllC Of the following cU'K'ctlnn prnlxvt (HMJ r*M mrh): 

j/Hj»n»b<- A'(i/AM)AC2crrcrrc:cu<:cri , <:crn , (rrmn , cJT- 

OCCTT(TAMRA)p 3' <iud p-itt-tin probe 5 r (TAM)AT(Jf.X:c;- 
XO'AMKAJCCOCCATCCCAT^p-.T where p indicates 
phosphorylation nnd X Indtrotcs a linker arm nucleotide. 
Rene lion Mk\% wen.* Mi<:mAmp Optical TuIks (part AUrtV 
IhtNROI 09X1, PcrWn iUiuur) tlwi wwv irottotl («t IVrWn 
ninicr) (o prevcul liyM from reflecting Tube cop> were 
similar MUfoAiMp tinps but specially dcsifiiicd to prc- 
yciii light scat Ivr'nig. All <>l lh<- PtIU vUm/iwtnul>lf* wcro svi>>- 
,Jii:vl 1>y PK Applied lliocyMcno (IWit (*!Wy, CA) cxeepl 
ihr furlor Vlll priiuefrs, wliiih w»*h- 5yr1tl1esly.nl at Cenvn 
lech, Inc. (South Trunclseo, CA). Ptohev wort* dc-sl^ned 
uMng the 01j;;r.' 4.0 software, following guUlpltiivs win* 
jiCMeci in mc Model 7700 .sequence Pvtenor hi.iiiuuieri( 
tuunuAl. HrlcHy, prube T„ »JmiiiUI be ftl Jcast 5 U C hlftbrr 
Uiaii 0»e anncunuK leni^Miurc u.icil durinj; ihrrmul cy- 
t'htigi primers shcs\iltl not fwnu m*Ij1v U«plexw*vith Hu- 
probe. 

Tlie thLTJi]4il cycling conditlotvs Ineluilvd 2 juln at 
50*C and 10 ruin al 9S a C. , n«aruial (.yeliilg prnm-clrd with 



rtactioiit wore |M2rfonued in tho K4odrl 7700 Sequence IV- , 
t«-tor (I»U ApplU'd Ulosj'Klwuiv), which coniahn * Oriw- 
Aiup H IU Sys»«m »3000. IU!a<:lion wutlition^ w<tp pro. 
grutllilteU ijm a Pwwor Maciutt»h VI 00 (Apple X^mpninr, 
Soma Uaru, t^V) linked diwly to the Model WOO .V- 
tjuciiw ixilffcior. Ana'y» u ^ aU w ** J » lw - 1 l«^' ,f » n ' ,fi nn 
tUv MHi tntruh computer. < Vtl taction and an5tlyclK mftwaro 
wiw devclo|wl « 10% Applied Wo«y*iums. 

Tramfection of Cells with Factor Vlll Coiwtruct 

J-'nur T17.S Oaslti of 293 cells (ATOO CRI. J57H), * human 
fetol kidney siwpentian cell line, were Rrnwii to R0% con- 
Oucney And transfctied ptrTCM. Celli were k^wii ^ thy 
Allowing mcdlni S0% HAM'S HI 2 withom C5HT, 50% U»w) 
lJucose J1wJIkw>'s modified Kaxle itiediuin (l)MIi.M) with* 
can glycine wiUi sodium bicarr>nnate, J0% leml Ik wine 
serum, 2 inm L-(;lui<«»i'0t, and 1% penidilin-slreptnmy- 
On, The media wm dwnffcd 30 mln M^«, tl,<* bansfee 
lion, plOTM DNA amounts of 40, 4, OS, nnd 0.1 pf; were 
iidilecl ti> 1..S ml of a solution ontalnlnR 0.125 m C^»a ? ; 
«nd 1 x MUCUS. The four mixhmrt were left al room tern- 
pcm«n' <<" ntin and them iKMinl rlnipwlM* U> il*c cells. 
Tliv n»>k* w.-if,irit.uUilcd al 37°C". and 5.% < "O^ f«r 24 hr, 
washed with PUS, ***tl nw u spcndcd In PllS. The rcmi.q 
jamhIv^I cells were divided into i»litjuni« uod UNA wfl4 cv- 
tmeted lunnediutely usinR the QIAump KUmkI Kit (Qupon. 
ai«l9wvrtl), C«A), l>NA wu.s e.lwUd tnlo 200 »»l of SO n.u 
TrMlCJulpIl H.0, 
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WISP genes are members of the connective tissue growth factor 
family that are up-regulated in Wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and W1SP-2, that are up-regulated in the 
mouse mammary epithelial cell line CS7MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (/) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracaine repressive promoter, and («) Wnt-1 transgenic 

mice. The WISP-1 gene was localized to human chromosome 

8q24.1-8q243. WISP-1 genomic DNA was amplified in colon 

cancer cell lines and in human colon tumors and its RNA 

overexpressed (2- to > 30-fold) in 84% of the tumors examined 

compared with patient-matched normal mucosa. WISP-3 

mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 

In contrast, W1SP-2 mapped to human chromosome 20ql2- 

20ql3 and its DNA was amplified, but RNA expression was 

reduced (2- to >30-fold) in 79% of the tumors. These results 

suggest that the WISP genes may be downstream of Wnt-1 

signaling and that aberrant levels of WISP expression in colon 

cancer may play a role in colon tumorigenesis. 

Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsn) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-30 (GSK-30) resulting in an increase in 
fi-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lef 1, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-30, binds 
to 0-catenin, and facilitates its degradation. Mutations in * 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-j3 superfamily, and the 
homeobox genes, engrailed \goosecoid % twin (Xtwn\ and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtr active hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
rtov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 
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VWC, von Willebrand factor type C module. 
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cDNA was synthesized froni 2 jig of poly(A)* RN A isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 u,g 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human W1SP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 u,M of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense nbo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) . for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centnfugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<Act) w here ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
a-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The H//S/>-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP -1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and £). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on 0-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/ Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-L The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of -40,000 (M r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of **27,000 {M x 21 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP- J and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 {A) and 
WISP-2 (B) expression in C57MG, C57MG/WnM, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 jmg) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP-1- specific probe 
(amino acids 278-300) or a 190-bp WSW-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-I (A) and mouse and human WISP-2 (£). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1 . 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
" human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 14). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-L WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (1GF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 2 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 2 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 £-//). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C,£, and G) Representative hematoxyiin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately welNdifferentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and 5), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (£ and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 
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the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that, the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors {P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jig) 
digested with EcoRl (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PGR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WlSP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
WnM. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., j3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1 -transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through 0-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or tiov, A recent report has shown that integrin a v ft serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control* the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
. tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-01, which is the stimulus for 
stromal proliferation (34). TGF-j31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-I 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISPS RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP--2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-I, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down -regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPM1 growth medium. T-ceU-proliferation assays were 
done essentially as described 2 " 1 . Briefly, after antigen pulsing t30u-gmr' 
TTCF) with tetrapeptides (l-2mgmr l ), PBMCs or EBV-B cells were 
washed in PBS and fixed,for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of 0.1M and the cells were washed five times in RPM1 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 uXi of 3 H-thymidine and harvested for scintillation counting 1 6 h later. 
Predigestion of native TTCF was done by incubating 200 u,g TTCF with 0.25 u,g 
pig kidney legumain in 500 uJ 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C 
Glycopeptide digestions. The peptides HIDNEEDI, HIDN(N-glucosamine) 
EEDI and H1DNESD1, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mU ml" 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgrnl" a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by 'activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate inunune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumour!; studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 3 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG)\ 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy-terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family , ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6,7 , Apo3L/TWEAK s ' 9 , or OPGL/TRANCE/ 
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RANKL 10 " 12 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
l.l±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 jJLgml" 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (A1CD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleukin-2-stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 
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induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes 1,14 " 1 *. Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at — lu.gml" 1 ; the residual 
killing was probably mediated by the perforin/ grahzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL' 7 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 



Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-termtnal 101 
residues of OPG are not shown. The putative signai cleavage site (arrow), the 
cysteine-rich domains (CRD 1-4), and the A/-linked glycosyiation site (asterisk) are 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adutt tissues or cancer cell lines. PBL. peripheral blood 
lymphocyte. 
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Figure 2 Interaction of DcR3 with FasL. a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area), TNFR1-Fc (dotted line) or buffer control 
{dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference {P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE. phycoerythrin- 
tabelled cells, b, 293 cells were transfected as in a and metabolicatly labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated withTNFRl-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration: column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3-based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM218xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 ' 19 . . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 
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Figure 3 Inhibition of FasL activity by DcR3. a. Human Jurkat T leukaemia ceils 
were incubated with Flag-tagged soluble FasL (sFasL;.5ngmr l ) oligomerized 
with anti-Flag antibody (0.1 ^gmr 1 } in the presence of the proposed inhibitors 
DcR3-Fc. Fas-Fc or human IgGl arid assayed for apoptosis (mean - s.e.m. of 
triplicates), b, Jurkat cells were incubated with sFasL-Flag.plus anti-Flag antibody 
as in a. in presence of 1 ug ml"' DcR3-Fc (rilled circles). Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c, Peripheral blood T cells were stimulated with PHA and interleukin-2. 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGl Fas-Fc. or DcR3-Fc (I0u.gmr'). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d. Peripheral blood natural killer celts were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (rilled circles). Fas-Fc (open 
circles) or human IgGl (triangles), and target-celt death was determined by 
release of 5, Cr (mean ± s.d. for two donors, each in triplicate). 
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Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas {c, d, f. g. h, j, k, r), seven squamous-cell carcinomas (a, e, 
m, n. o, p. q). one non-small-cell carcinoma (b). one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c. In situ hybridization , 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-held image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S). blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of OcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward, Rev and Fwd), the 
DcR3-linked marker T160. and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P <0.01 for a Student's f-test 
comparing each marker with DcR3. 
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FasL binding; hence, it may ^present a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG\ which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. O 



Methods 

Isolation of DcR3 cDNA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl. expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u.g), together with pRK5 encoding CrmA 
(2u.g) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc or TNFRl-Fc and then with phycoerythrin-conjugated 
streptavidin ( GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitutively on 293 cells. 

Immunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 35 S]cysteine and [ 35 S] methionine (0.5 mCi; 
Amersham). After 16h of culture in the presence of z-VAD-fmk (10u.M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5 u-g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 u.g) (Alexis) was incubated 
with each Fc-fusion protein (1 u-g). precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25u.g) was 
incubated with bufTer or with DcR3-Fc (40 u.g) for U5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 u.1 aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u-g ml" 1 ) for 24 h, and cultured 
in the presence of interleukin-2 (100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4* cells". 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 3, Cr-Ioaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of.DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of *'Cr in effector- target co- 
cultures relative to release of 5, Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR 11 
using a TaqMan instrument ( AB1). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcRJ or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to -500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3 -specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5'-(FAM-ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 (iCT) , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous £. coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl -Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and E. coli u ~* is a 
well-characterized ABC transporter that is a good model for this 
superfamily It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM*. The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity 8 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an l U with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded (3- 
sheet ((33 and 08-012) spans both arms of the L, with a domain of a 
a- plus 0-type structure (01, 02, 04-07, ctl and ct2) on one side 
(within arm I) and a domain of mostly a-helices (a3-a9) on the 
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Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The (J-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a, towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661 -666, 1998. 
© J 998 Wiley-Liss.' Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et ai, 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( 1 1 q 1 3), and erbBl ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbBl proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns et ai, 1992; 
Schuuring et al., 1992; 9iamon et ai. 1987). Muss et ai (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in rumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et al., 1996; Heid et 
ai, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et ai (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e.. ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
vaiues gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the. need for serial dilution; (iv) The real-time PCR 
method is run in a closed-rube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
. sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 nr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and erb&2)< as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 



Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C, (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C, and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e.. lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no genetic alterations have been found in 
breast-rumor DNA by means of CGH (Kallioniemi et al, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic. DNA. The ratio defining the level of amplification is 
termed *'N", and is determined as follows: 

copy number of target gene (app, myc, ccndl, erbB2) 

N = ■■ 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo. Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for ■ 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (10 5 copies of each gene) to 
10" 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored aj -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 hg). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400. 
uM dUTP, 5 mM MgCl 2 , 1.25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 1 0 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C, and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erb&l proto-oncogenes, 
and the P-amyloid precursor protein gene (app), which maps to a 
chromosome region (21q21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et al, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 2 copies or as 
many as 1 0 5 copies. 

Copy-number ratio of the 2 reference genes (app and alb/ 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 s (A9), 10 4 (A7), 10 3 (A4) to 10 2 (A2) and a no-template control (Al), Cycle number is plotted vs. change in normalized reporter signal 
(ARn), For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C t (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be delected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app, 2 1 q2 1 .2; alb, 4q 1 1 -q 1 3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et al, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erb£2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 1 8 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for mvc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 ± 0.19) for erbBl. SinceN values 
for myc, ccndl and erbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1 .6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbiJJ? gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 1 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (Tl 18). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbBl and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real -time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N > 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, .standard- 
ized and rapid method were perfected. Gene amplification, status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE 1 - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND erbhl GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 


2-4.9 




myc 

ccndl 

erbBl 


0 
0 

5(4.6%) 


97(89.8%) . 
83 (76.9%) 
87 (80.6%) 


11 (10.2%) 
17(15.7%) 
8 (7.4%) 


0 

8 (7.4%) 
8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA. which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin -embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et al, 1 994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-gtycosylase (UNG) 
(Longo et ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 rumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin- fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C t to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C t value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C t ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
-heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et al, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (0 Chromosome regions 4qll-ql3 and 2 1 q21 .2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et al, 1994). (ii) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Bems et al, 1 992; Borg et al, 1992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et al (1 992) and Courjal et al 
(1997). (iv) The maxima of ccndl and erbBl over-representation 
were 1 8-fold and 15-fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (E 12, C6, black squares), T133 (Gil, B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment. Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table 11. 

30-fold maximum) (Berns e/ 1992; Borgetal, 1992; Courjal et et ai t 1996). Our results also correlate well with those recently 

al, 1997). (v) The erbB2 copy numbers obtained with real-time published by Gelmini et al. (1997), who used the TaqMan system to 

PCR were in good agreement with data obtained with other measure erb&2 amplification in a small series of breast tumors 

quantitative PCR-based assays in terms of the frequency and (n = 25), but with an instrument (LS-50B luminescence spectrom- 

degree of amplification (An cr/oA, 1995; Deng et al, 1996; Valeron eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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TABLE II - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




Uccndl/ulb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


Til 8 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 






6.03 




61659 


61100 


1111 


10092 


10137 


375 




61821 






10533 








. T145 


128563 






732V 






16.34 




125892 


125392 


3448 


7762 


7672 


316 




121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Hccndl/alb) is determined by dividing the average ccndl 
copy number value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(> 5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erbB2) observed by means of real-time 
quantitative PCR as compared -with Southern-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamoneva/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbB2 (but not of the other 2 proto- 
oricogenes) in several tumors; erbB2 is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1 995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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Structure of Apo2L/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
563-571 (1999). 

: 55. Hymowitz, S.G., O'Connel, M.P.JJtsch, M.H., Hurst, A., Totpal, K., Ashkenazi, 
A„ de Vos, A.M., Kelley, R.F. A unique zinc-binding site revealed by a high- 
resolution X-ray structure of homotrimeric Apo2L/TRATJL. Biochemistry 39, 633- 
640(2000). 

56. Zhou, Q., Fukushima, P., DeGraff, W., Mitchell, J.B., Stetler-Stevenson, M., 
Ashkenazi.- A., and Steeg, P.S. Radiation and the Apo2L/TRAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer cells overexpressing cyclin Dl. Cancer Res. 60, 261 1-2615 (2000). 

57. Kischkel, F.C., Lawrence, D. A., Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi. A. Apo21/TRAIL-dependent recruitment of endogenous FADD and 
Caspase-8 to death receptors 4 and 5. Immunity 12, 61 1-620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, I.S., Wang, H., *Ashkoiazi J _A., and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol. 1,37-41 (2000). 
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59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and Ashkenazi, A. 
Interaction of the TNF homologues BLyS and APRIL with the TNF receptor 
homologues BCMA and TACI. Curr. Biol. 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazi, A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Biotechniques 29, 506-512 (2000). 

.61 . Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, K., Shih, D. Mounho, B., 

Hillan, K., Totpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L., Gliniak, B., Bussiere, J., Smith, C, Strom, S., Kelley, S., 
Fox, J., Thomas, D., and Ashkenazi. A. Differential hepatocyte toxicity of 
recombinant Apo2L/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. Chuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 
Koeppen, H., Ashkenazi. A ., and Kim, K. J. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. J. Immunol. 
166,4891-4898(2001). 

63. Pollack, I.F., Erff, M., and Ashkenazj 3 _A. Direct stimulation of apoptotic 
signaling by soluble Apo2L/tumor necrosis factor-related apoptosis-inducing 
ligand leads to selective killing of glioma cells. Clin. Cancer Res. 7, 1362-1369 
(2001). 

64 Wang, H., Marsters, S. A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yan, 
M., Dixit, V.M., * Ashkenazi. A ., and *Grewal, I.S. TACI-ligand interactions are 
required for T cell activation and collagen-induced arthritis in mice. Nature 
Immunol. 2, 632-637 (2001). 

65. Kischkel, F.C., Lawrence, D. A., Tinel, A., Virmani, A., Schow, P., Gazdar, A., 
Blenis, J., Arnott, D., and Ashkenazi. A . Death receptor recruitment of 
endogenous caspase-10 and apoptosis initiation in the absence of caspase-8. J. 
Biol. Chem. 276, 46639-46646 (2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E., Totpal, K., Morlan, J ., Schow, P., 
Fong, S., Schwall, R., Sinicropi, D., and Ashkenazi. A Tumor cell resistance to 
death receptor induced apoptosis through mutational inactivation of the 
proapoptotitc Bcl-2 homolog Bax. Nature Med. 8, 274-281 (2002). 

67. Miller, K., Meng, G., Liu, J., Hurst, A., Hsei, V., Wong, W-L., Ekert, R., 
Lawrence, D., Sherwood, S., DeForge, L., Gaudreault., Keller, G., Sliwkowski, 
M.. Ashkenazi. A ., and Presta, L. Design, Construction, and analyses of 
multivalent antibodies. J. Immunol. 170, 4854-4861 (2003). 



8 



68. Varfolomeev, E., Kischkel, F., Martin, F., Wanh, H., Lawrence, D., Olsson, C, 
Tom, L., Erickson, S., French, D., Schow, P., Grewal, I. and Ashkenazi, A. 
Immune system development in APRIL knockout mice. Submitted. 

Review articles: 

1 . Ashkenazi, A., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology, LIII, 263-272 (1988). 

2. AshkenazLA., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Scu Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Ammann, A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
R., Capon, D., Ward, R., and Ashkenazi A . CD4 immunoadhesins in anti-HIV 
therapy: new developments. Int. 1 Cancer Supplement 7, 69-72 (1992). 

4. AshkenazLA., Capon, and D. Ward, R. Immunoadhesins. Int. Rev. Immunol. 10, 
217-225 (1993). 

5. Ashkenazi A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and Ashkenazi, A. * 
Inhibition of erythroid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (N. Abraham, R. Shadduck, A. Levine F. 
Takaku, eds.) Intercept Ltd. Paris, Vol. 3, p. 135-147 (1994). 

7. Ashkenazi, A . Cytokine neutralization as a potential therapeutic approach for 
SIRS and shock. J. Biotechnology in Healthcare 1, 197-206 (1994). 

8. Ashkenazi, A ., and Chamow, S. M. Immunoadhesins: an alternative to human 
monoclonal antibodies. Ifnmunomethods: A companion to Methods in 
Enzimology 8, 104-115 (1995). 

9. Chamow, S., and Ashkenazi, A . Immunoadhesins: Principles and Applications. 
Trends Biotech. 14, 52-60 (1996). 

10. Ashkenazi; A ., and Chamow, S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol. 9, 195-200 (1997). 

11. Ashkenazi, A ., and Dixit, V. Death receptors: signaling and modulation. Science 
281,1305-1308 (1998). 

12. Ashkenazi, A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol. 11, 255-260 (1999). 
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13. Ashkenazi. A . Chapters on Apo2IVTRAIL; DR4, DR5, DcRl, DcR2; and DcR3. 
Online ^yt"V ; "" MsnHhnnV (www a pnet.com/cvtokinereference /). 

14. Ashkenazi. A . Targeting death and decoy receptors of the tumor necrosis factor 
superfamily. Nature Rev. Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazi. A . Apoptosis signaling by Apo2L/TRAIL. Cell Death 
and Differentiation 10, 66-75 (2003). 

1 6. Almasan, A. and Ashkenazi. A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine arid Growth Factor Reviews 14, 337-348 
(2003). 



Book: 

Antibody Fusion Proteins (Chamow, S., and Ashkenazi, A., eds., John Wiley and 
Sons Inc.) (1999). 1 



Talks: 

1 . Resistance of primary HIV isolates to CD4 is independent of CD4-gpl20 binding 
affinity. UCSD Symposium, HTV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. IBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3. Results with TNF receptor Imrnunoadhesins for the Treatment of Sepsis. IBC 
conference on Endotoxemia and Sepsis. Philadelphia, PA, June 1992. 

4. Immunoadhesins: an alternative to human antibodies. IBC conference on 
Antibody Engineering. San Diego, C A, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1993. 

6. Protective efficiacy of TNF receptor imriiunoadhesin vs ariti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th International Congress on TNF. 
Asilomar, CA,May 1994. 

7. Interferon-y signals via a multisubunit receptor complex that contains two types of 
polypeptide chain. American Association of Immunologists Conference. San 
Franciso, CA, July 1995. 

8. Immunoadhesins: Principles and Applications. Gordon Research Conference on 
Drug Delivery in Biology and Medicine. Ventura, CA, February 1996. 
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9. Apo-2 Ligand, a new member of the TNF family that induces apoptosis in tumor 
cells. Cambridge Symposium on TNF and Related Cytokines in Treatment of 
Cancer. Hilton-Head, NC, March 1996. 

10. Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 

Orleans, LA, June, 1996. 

11. Apo2 ligand, an extracellular trigger of apoptosis. 2nd Clontech Symposium, 

Palo Alto, CA, October 1996. 

12. Regulation of apoptosis by members of the TNF ligand and receptor families. 
Stanford University School of Medicine, Palo Alto, CA, December 1996. 

13. Apo-3: ahovel receptor that regulates cell death and inflammation. 4th 
International Congress on Immune Consequences of Trauma, Shock, and Sepsis. 
Munich, Germany, March 1997. 

14. New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, CA, March 1997. 

15. Immunoadhesins: an alternative to monoclonal antibodies. 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, June 1997. 

16. Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed Cell Death. Cold Spring Harbor, New York. September, 1997. 

17. Chairman and speaker, Apoptosis Signaling session. BC's 4th Annual 
Conference on Apoptosis. San Diego, CA., October 1997. 

18. Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA February 1998. 

19. Apo2 ligand and its receptors. American Society of Immunologists. San 
Francisco, CA, April 1998. 

20. Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 

May 1998. 

21. Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, 
. CA, June 1998. 

.22. Apo2L as a potential therapeutic for cancer. Gordon Research Conference on . 

Cancer Chemotherapy. New London, NH, July 1998. 
23: Control of apoptosis by Apo2L. Endocrine Society Conference, Stevenson, WA, 

August 1998. 

24. Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 
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25 . Apoptosis control by death and decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1 999. 

26. Apoptosis control by death and decoy receptors. American Society for 
Biochemistry and Molecular Biology Conference, San Francisco, CA, May 1999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research 

Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2L/TRAIL. Cold 

Spring Harbor Laboratory Symposium on Programmed Cell Death. . Cold Spring 
Harbor, NY, September 1999. 

30. The Apo2L/TRAIL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

31. Apoptosis and cancer therapy. Stanford University School of Medicine, Stanford, 
CA, Mar 2000. 

32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philladelphia, PA, Apr 2000. 

33. Apoptosis signaling by Apo2L/TRAJL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2I7TRABL system: therapeutic potential. Cap-CURE summit meeting. 

Santa Monica, CA, June 2000. 

35. The Apo2L/TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. Apoptosis signaling by Apo2L/TRAIL. The Protein Society, 14 th Symposium. 
San Diego, CA, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 

Aug 2000. 

38. Apoptosis signaling and anti-cancer potential of Apo2I7TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2L/TRAIL. Kenote address, TNF family 
Minisymposium, NTH. Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001. . 

41. Preclinical studies of Apo2L/TRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 
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42. Apoptosis signaling by Apo2L/TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43 . Apo2I7TRAIL: Apoptosis signaling and potential for cancer therapy. Weizmann 
Institute of Science, Rehovot, Israel, March 2001. 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, North Falmouth, MA, Jun 2001. 

45. Targeting death receptors in cancer with Apo2L/TRAIL. Biotechnology 
Organization conference, San Diego, CA, Jun 2001. 

46. Apo2L/TRAJL signaling and apoptosis resistance mechanisms. Gordon Research 
Conference on Apoptosis, Oxford, UK, July 200 1 . 

47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH, Oct 2001. 

49. • Apoptosis signaling by death receptors. American Society of Nephrology 

Conference. San Francisco, CA, Oct 2001 . 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA, Apr 2002. 

51. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Kirnrnel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2L/TRAJL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53. Apoptosis signaling by Apo2L/TRAJJL. (Session co-chair) TNF international, 
conference. San Diego, CA. October 2002. 

54. Apoptosis signaling by Apo2L/TRAJJL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jan 2003. 

55. Apoptosis induction with Apo2L/TRAJX. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2I7TRAIL. Hermelin Brain Tumor Center 
Symposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference on 
Targeted Therapies in the Treatment of Breast Cancer. Kona, Hawaii. July 2003. 

58. Targeting apoptosis through death receptors. Second International Conference on 
Targeted Cancer Therapy. Washington, DC. Aug 2003. 
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1 . Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,329,028 (Jul 12, 1994). 

2. Ashkenazi, A., Chamow, S. and Kogan, T, Carbohydrate-directed crosslinking 
reagents. US patent 5,605,791 (Feb 25, 1997). 

3. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,889,155 (Jul 27, 1999). 

4; Ashkenazi, A., APO-2 Ligand. US patent 6,030,945 (Feb 29, 2000). 

5. Ashkenazi, A., Chuntharapai, A., Kim, J., APO-2 ligand antibodies.. US patent 6, 
046, 048 (Apr 4, 2000). 

6. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,124,435 (Sep 26, 2000). 

7. Ashkenazi, A., Chuntharapai, A., Kim, J., Method for making monoclonal and cross- 
reactive antibodies. US patent 6,252,050 (Jun 26, 2001). 

8. Ashkenazi, A. APO-2 Receptor. US patent 6,342,369 (Jan 29, 2002). 

9. Ashkenazi, A. Fong, S., Goddard, A., Gurney, A., Napier, M., Tumas, D., Wood, W. 
A-33 polypeptides. US patent 6,410,708 (Jun 25, 2002). 

10. Ashkenazi, A. APO-3 Receptor. US patent 6,462,176 Bl (Oct 8, 2002). 

1 1. Ashkenazi, A. APO-2LI and APO-3 polypeptide antibodies. US patent 6,469,144 Bl 
(Oct 22, 2002). 

12. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,582,928B1 (Jun 24, 2003). 
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High-throughput technologies, such as proteomic screening and DMA micrc-arrays. produce vast 
amounts of data requinng comprehensive analytical methods to decipher the biofoglcally return 
resutts. One approach would be to manually search the biomedica. literature; howeve^ thi vSa Z 
an arduous task. We developed an automated Hterature-mlrdng tool, termed MedGene^ whl* 
comprehensively summarizes and estimates the relative strength, of all human geS-d^se 
retetionshlps .n Medline. Using MedGene. we analyzed a novel micro-array exprei ton cS 
comparing breast cancer and normal breast tissue in the context of existing knowledge. wTfound^ 
correlation between the strength of the literature association and the magnitude of diffwe^ Z • 
expression level when considering change, as high as 5-fold; however. a*signincant mSS^l 

nteresungly. th, only held true for esuogen receptor (ER) positive tumors, not ER negaUve. iL^e 

simitr^ 
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Introduction 

At Its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own Immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g. f 
breast cancer, a researcher would have had to scan 130 different 
Journals and read 27 papers per day In 1999.' This problem Is 
accentuated with high- throughput technologies such as DNA 
micro-arrays and proteomlcs. which require the analysis of 
large datasets Involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any mlcroarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study, The ability to Interpret these datasets 
would be enhanced If they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
Is a need to summarize existing knowledge In a format that 
allows for the rapid analysis or associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem Is to compile structured digital 
resources, such as the Breast Cancer Gene Database* and the 
Tumor Cene Database. 1 However, as these resources are hand- 
curated, the labor-Intensive review process becomes a rate- 
limiting step In the growth of the database. As a result, these 

•TowhomeomHpondiro Jbhaw^hn^hajvanledj. 
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databases have a limited scale and the genes are not selected 
In a systematic fashion. 

An alternative approach Is automated text mining; a method 
which Involves automated Information extraction by searching 
documents for text strings and analyzing their frequency and 
context This approach has been used successfully In several 
Instances for biological applications. In most cases, it has been 
applied to extract Information about the relationships or 
nteractlons that proteins or genes have with one another, in 
the literature or by functional annotation.*-' Thus far few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta et al 
automatical* examined the CO (Cene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to Identify genes linked to Inherited disorders.* 

To obtain a mora global understanding of disease develop- 
ment. It would be valuable to Incorporate Information regarding 
all possible gene-dlseas* relationships. Including biochemical 
physiological pharmacological, epidemiological, as well as 
genetic This Information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge In the literature. This would accomplish two things 
First. It would serve to validate experiments by demonstrating 

J^TT."? 1,011585 ^ M P"* 1 "**. Second. It would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel In a given context. We have utilized 
a computational approach to literature mining to produce a 
JcvmaJ or PtoUon* flesaanch 200X I 405-412 409 
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comprehensive set or gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength or 
each association based on the frequency of citation and co- 
cliattoa We applied this tool to help Interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 



Methods 

MedGene Database. MedCene Is a relational database, stor- 
ing disease and gene Information from NCBI, text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user-Interface for users to 
query the database (hDy//hIpseq.med.harvard^ii/MedCene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the McSH web site at NLM (Nation library of Medicine) (http-7/ 
www.nlm.nlh.gov/mesh/meshhomehtmO and human disease 
categories were selected. LocusLlnk files were downloaded from 
the LocusLlnk web site at NCBI (http://www.ncbLnlh.gov/ 
LocusLlnk/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLlnk record, were 
collected. Gene search terms were used for literature searching 
and Included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene oQlclal/prererred symbols, were used 
to Index Medline records. If the ofnclal/prererred gene symbols 
did not meet the standards to be an Index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July. 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then Indexed 
each Medline record with the appropriate disease terms, A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to Index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were Indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
Included chl-square analysis. Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 19 (httpV/ 
hlpseq.med,harvard.edu/MedCene/). In addition, we computed 
the "product or frequency*, which Is the product of the 
proportion or disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPJ? to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and mlcroarray fold chance In 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using TreeVlewer" (http:// 
rana.lbI.gov/ElsenSoftware.htm). 
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Breast Tissue Micro-Anrays, Eighty-nine breast cancer 
samples (7996 ER-posltlve) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor Immuno»phent> 
types of breast cancer. Blotinylated cRNA. generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Aflymetrix U95A ollgo- nucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Aflymetrix. and then further analyzed using 
the DNA-Chlp Analyzer (dChip) custom software. 

Results — 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first complied complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) Index 
of Medline records was utilized. MeSH Is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance.™ Twenty-three human disease category 
headings along with all of their child terms (seethe Supporting 
Information. Supplemental Table 1 # or visit httpi/Zhlpseq 
med.harvard.edu/MedCene/publlcaUon/s Table I.htmDwere 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No Index comparable to the MeSH Index exists for genes 
and thus. It was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list of genes, gene names, gene symbols, and frequently used 

252T?^ Were collectcd from the LocusLlnk database at 
NCBI. 11 * 11 which contains 53 259 Independent records keyed 
by an official gene symbol or name (June 18* 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use or Italics. If at all 
For the Intended use of this study, this lack of distinction Is' 
unlikely to have a large effect and may In fact be benendal. 
Initial attempts to search the literature using these lists 

^m^T^ 1 S0Urces of fa,se P 051 ^* *"d false negatives 
(Table 1). Fabe positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development or niters 
to reduce these errors. The syntax issues were readily handled 
by Including alternate syntax forms In the search terms. The 
Take positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced Inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirety, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes or data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were Indexed with a primary gene key 
when any synonym for that key was found In the title or 
abstract. Case-lnsensitlve string mapping was used for all 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of false Positives and False Negatives in Unfiltered Data* 



research articles 




gene symbol/name 
Is not unique 



gene symbol is 

unrelated abbreviation 
gene symbol/name 

has language meaning 
nonstandard syntax 
unofficial gene name/symbol 
nonspeclfled gene name ) 



false positive 

false positive 

false negative 
false negative 
false negative 



• In preliminary studies. Medline was 
were amenable to global titters. Each en 
take negatives are real rebitonshipa that 
error. In general error rates maximized 



M4C-myeI*n 

associated glycoprotein 
M40-^ria] Ignancy-associated 
protein 

fA-palltd homoJogue (mouse). 

pallldln (also abbrev. for Pennsylvania) 
WA^Wbkou-AIdrlch Syndrome 

(also the word *was*) 
BAG-1 instead of BAC1 

J Instead of TPS3 
estrogen receptor Instead of 

Estrogen receptor 1 



fiber solution 



eliminate this term 



eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



•cnritMry, even at the expense of ipecifldty if needed 



added for multiple occurrences of a term or the cooccurrence 
of multiple synonyms for (he same gene key. 

Medline records were searched with all qualified gene 
Identifiers, such as the omclal/preferrcd gene symbol, the 
ofllclal/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g., estrogen receptor.lnstead of 
estrogen receptor 1 (ESRt). creating a source or false negatives. 
For this reason, gone famlry stem terms were created for all 
genes that have an alpha or numerical suffix (eg., IL2RA, TCFfi. 
BSRU etc) and then used to search the literature The famlry 
stem terms were handled separately from the specific gene 
names so that It would be clear when linkages were made to 
the gene family versus a specific member In (hat famlry. 

To Improve performance and accuracy, some pre-selectlon 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second. non-English Journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, Journals unlikely to contain primary data 
about gene-disease relationships were also removed (e.g., Int. 
J. health Edui. Bedside Nurse, and / Health Econ). Together, 
these filters reduced the 12 198 221 Medline publications flulv 
2003 by 37%. 1 
Ranking the Relative Strengths or Gene Disease Associa- 
tions. In total, there were 618 708 gene-disease co-cltatlons, 
In which 16% (8297) or all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, wc tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order or the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because It gave the best results overall. 

Validation of MedGene. In developing this tool, It was 
Important to minimize the number of missed genes (raise 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were In conflict, incluslveness was 
prioritized To determine the false negative rate In MedGene, 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




^ \ Estimatlon of *■ ' al *> negative rate by comparison 
TJl ^TT 6 5**°**^ Th * freest cancer^leu^S 
Identified by MedGene were compared with those IteSdin 

nr^i <fV atabaW lnc,udlno Tumor Gene Database" 
(TGD) f * the Breast Cancer Gene OatabaseffiCG) ' Gen^rn! 
(GC,n and Swissprot" Genes were conslSL r^^ 

ln^M„ ro M^ ntod ,n at,ea51 of these other databases 
end not In MedGene and their link to breast cancer was sup. 
ported by at toast one literature reference. All literature references 
were verified by manual review to confirm their validity The 
number of genes In each database or shared by more than one 
database is Indicated. The false negative rate was calculated S 
genes missed at MedGene (2«/total number of nc^ove^S 
genes in other databases (285). 9 

there were several public databases that link genes to breast 
cancer We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated In Figure 1 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation In these hand- 
curated databases, 26 were absent from MedGene. suggesting 
a false negative rate of approximately 9%, To determine whv 
these were missed all literature references for these genes (80 
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. papers) were reviewed manually (see the Supporting Informa- 
tion. Supplemental Table 2. or visit http://hlpseq.med. 
harvardedu/MedGene/publlcatlon/s.Table 2.htmD. Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned In 
review papers (0.4%) or they appeared only In the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedCehe Identified approximately 2000 additional breast 
cancellated genes not listed In any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysts of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and Its sources, whereas 
* hc global approach-tested whether the overall results made 
biomedical sense. 
Using the LPP. 1467 genes related to prostate cancer were 
I assembled In rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 1 00 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one' of the five 
categories that reflect meaningful gene^dlsease relationships 
(see the Supporting Information, Supplemental Table 3, or visit 

http://hlpseq.med.harvard.edu/MedGene/publlcatlon/ 
sJTable 3,htm0. Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two In which the association between 
the gene and the disease was described as negative, Both wore 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gone or protein was not shown to be 
related to human prostate cancer. 1 *" 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted In most of the false positives (see the Supporting 
Information. Supplemental Table 4, or visit http://hlpse- 
q ( med.harvard.edu/MedCene/publlcatlon/s_Table 4.html), em- 
phasizing the Importance of the filters thafwere added In the 
search algorithm (Table 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESRl and 10 to ESRl whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESJ&, 
this latter group was detected by the family stem term niter' 
To further validate these results, a global analysis of the gene- 
disease relationships described by MedCene was performed 
For this experiment. It was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships denned 
by MedCene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering or the gene data should group 
diseases In a manner consistent with common medical think- 
ing. Conversely, If the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or Inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were Indeed logical based upon 
common medical knowledge (see the Supporting Information. 
408 Journal of Proteome ftasearcb • Vol 2, No. 4. 2003 
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Supplemental Figure 1, or visit http://hlpseq.med.harvard.edu/ 
MedCene/publtcation/sJigure l.htmQ. For example. In one 
such cluster shown In Figure & diabetes and Its complications 
grouped together and were also closely linked to diseases 
associated with starvation states, 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by the false 
negative rate (-9%) and down by the febe positive rate (-26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean ± s.d.) genes associated with It although the range Is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Dataset Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis Identified 
2286 genes that had greater than a 1-fold difference In mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer In the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (Srst- 
degree association by family terra); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association): and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2. or visit http://hipseq.nted.harvard.edu/MedGene/ 
publlcatlon/sjlgure 2.html) Among the 505 previously un- 
related genes. 467 were either newly Identified genes or genes 
that had not previously been associated with any disease 
Among the remaining 38 genes. 0 had been related to other" 
cancers, specifically esophageal colon, uterine, skin, and cervix. 
• To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer In the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
ess than i-fold change (68%) to over 40-foW (0.3%). Notably 
the majority or genes with greater than 10-fold expression 
changes were linked to breast cancer by flrst-degreeassocla- 
tlon. 

Among all 754 genes directly linked to breast cancer In the 
literature, there was no correlation between LPP and micro- 

ll^H 86 (re ° m ^ value - °- 62 >- However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend In correlation (Figure 
3B) suggesting that genes with a more substantial change In 

In the literature For genes that had 10-fold change or more In 
expression level, the correlation Increased Id 0.41 (p-value » 

When we evaluated the rnfcro-array data separately for ER 
positive and ER negative tumors, the trend Tn correlation 
between fold change and literature score was highly dependent 
on estrogen receptor statu*. Interestingly, then? was a similar 

correlation for ER negative tumors. 
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tfa*f 



B 



Coxaeoklevirue Infection* 
Obeelty la Diabetee 
Diabetic fcetoacldoela 
Glueoae Intolerance 

Diabetee Mellltua, Ron- Ineulin- Dependent 

Diabetee Mellltua, Ineulin-Dependent 

Pregnancy in Diabetica 

Diabetic Retinopathy 

Diabetic Anglopatnlee 

Diabetic Houropatnlee 

Glycosuria 

Hyperineullnlaa 

Hyperlneullnemla 

Hypoglycemia 

Hyparglyeeaia 

Diabetee Mellltua, Experimental 

Diabetee Mellltua 

01 aba tea, Qaatatlonal 

^T^aYroffon""*"*^""*"^"~*"*~--^--^*^' 
Jaundice, Neonatal 
Brain Bdeaa 
Pulaonary Bdeaa 
Nutrition Dlaordera 
KwaealorXor 
Critical Zllnaaa 
Burn* 

Diabetic Nephropathies 

Albuminuria 

Insulinoma 




them. A sample of the data 15 shown here. 2(B), One of the rMuirinn m ..tape i* JZ ^ p associated with 

terms (abovetfte line) sm> starvation states £ ^ * ^ D,abeU * 

diabetic small vessel complications, altered seaWcher^ T£ 9T °Tr\ * *** C,U5terlr * of 

harvard.edu/NtedGer^^ chemistries, nutritional disorders. etc.(SuDplementel Figure 1; http^/hlpseq.med. 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedGene for hypertension, a 



disease unrelated to breast cancer. As expected, we did not 
observe an Increasing trend In correlation for hyperten- 
sion. 
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Microarray Fold Change (Cancer/Normal) 



B 



0.8 



0.7 



0.1 



Microarray Fold Change 

Figure 3. Relationship between literature score and functional data for hr»a«i ranw un,.-. i 

samples for breast tumor* and normal breast tissue wer TanaNwS TJZSZZ *? m eXpress4on ■"■*■»• <* 
tumor and normal sample (cutoff a 3-folfl I chenori T^TftE ? ,0W d,fference oppression level between breast 
Green dots represent flrst-degree l^^T^T^K^l^Z^ Tt ^ ' COre for » TO *»» »«• 

dots represent no.a««l,tiom Some wel.-studted sCch S^T^l^t^T^ * ^ 
expression level. Furthermore/the majority of genej that £ n^.tth K '' "? ^ " y a «*•*"•* difference In 

expression changes (shaded area). 38. The^pe^a fSK^3SS2^ £T "^f " terature h «» •*» «»" 10-told 
of expression level between tumor and ^^s^r^^X^TJ?^ " tor8h "! KCn (LPF) 8nd »• W change 
(*axls). Gene rank lists were generated for i^MS^ffl^SiSSS SI 0 ' f ° W Ch8n9e ° f "P™'*'"-"! 
the breast cancer gene LPF scores and fold change eweSbn oW ^ * W " C ° rrelaUons ^computed between 

estrogen receptor negative tumors only (purple) T Mpre$S,0n data amon fl "« ro 9 en "^P 1 " P<«'«ve tumors only (light blue) and 
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Table t. Top 25 Genes Related to Selected Human Diseases* 



research articles 



breast neoplasms 



estrogen receptor 
PGR 

ERBB2 a 
BRCAl 
BRCA2 
EGPR 

cms 

TFFi 

PSEN2 

TPS3 

CESS 
CEACAM5 



ERBB3 _ 

cydin 

COXSA 

cathepsin 

ERBB4 

CCNDi 

EOF 

MUCl 



InsuIln-llke 
BCL2 

mucin . 
FCfS 



hypertension 
REN " 



rheumatoid anhrttU 



bipolar disorder 



LEP 
ACT 
INS 

kalllkrein 
ACE 

endothelln 

S10CA6 

BDK 

] DIANPH 
SARi 

-PBl- 



CD5$ 
ALB 

CYPUB2 
MATZB 
angiotensin 
receptor 

AGTR3 

NPPA 

LVM 

DBH 
NPY 

POKfC 
neuropeptide 



RA 

TNFRSF1QA 
CRP 
AS 
ESR! 

HLA-DRBi 
DRI 

knterieukln 

77VF 

R6 

collagen 
ILIA 

ACR 

TNFRSFJ2 
JL2 

CHISU 
JL8 

knterieukln I 
matrix 

metalloprotelnase 

Interferon 

CDS8 

JL4 
HI? 

MMP3 



ERDAI 

SNAP29 

PFKL 

DRD2 

TRH 

JMPA2 

Him 

DRD3 

REM 

KCNN3 

DRDi 
HTR2C 

RWT~ 

DBH 

MAOA 

com 
mm 

SYHJI 

INPPl 
NEDD4L 
FRA13C 
transducer of 



BAIAP3 

ATPIB3 
DRDS 



atherosclerosis 



apollpoproteln 

APOE 

LDLR 

ELN 

ARC! 

APOB 

APOAi 

MSRI 

IPL 

PONl 

plasminogen 
activator Inhibitor 
PIG 

vascular cell 

adhesion molecule 

ATOHi 

VWF 

INS 

ARG2 

ABCAI 

OLR1 

collagen 

MCP 

lipoprotein 
APOA2 
Intercellular 
adhesion molecule 
RAB27A 



Discussion 

The Human Genome Project heralded a new era in biological 
•research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organ teatlon 
and biological systems. High-throughput technologies can 
provide novel Insight into comprehensive biological function 
but also Introduces new challenges. The utility of these 
technologies Is limited to the ability to generate, analyze, and 
Interpret large gene lists. MedGene, a relational database 
derived by mining the Information in Medline, was created to 
address this need, MedGene users can query for a rank-ordered 
list of human gene*dlsease relationships (Table 2) for one or 
more diseases. Each entry Is hyperllnked to the original papers 
supporting each association and to other relevant databases. 

MedCene is an innovative extension of previous text mining 
approaches. Perez-Iratxeta et al. used the CO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to Inherited disorders. 1 MedGene takes a broader view 
and Includes all diseases and all possible gene-disease relation- 
ships. Furthermore. MedCene utilizes eo-cltarlon to Indicate a 
relationship ratlter than CO annotation, which Is limited to the 
subset of genes that have CO annotation. Our approach Is 
complementary to that taken by Chaussabel and Sher. who 
used the frequency of co-cltcd terms to cluster genes Into a 
hierarchy of gene-gene relationships.* 

A unique aspect of this tool Is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citatlon and single citation. This presupposes that 
most co-cltatlons describe a positive association, often referred 
to as publication bias" and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 

http://hlpseq.med.harvard.edU/MedGene/publlcatlon/s Ta- 
ble 3.htrnO. Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link; 
however. It is strong evidence to support a true relationship 
Another Important feature of MedCene Is the Implementa- 
tion of software filters that substantially reduced the error rate 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real 
For this study, all of the filters that we applied were general 
ones, e.g.. expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words etc 
The majority of the remaining search term ambiguities were 
Idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 

It Is not uncommon to see expression changes in micro- 
array experiments as small as 2-fold reported In the literature. 
Even when these expression changes are statistically significant, 
It is not always clear Ifthey are biologically meaningful. When 
comparing expression levels of disease to normal tissue one 
expects an enrichment or known disease-related genes to 
appear In the altered expression group. MedCene provided a 
unique opportunity to test this notion In the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less In tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Tabla 3. Genes with Large Expression Changes In ER- but 
Not In ER+ Breast Tumors 



gene symbol 


fold chang* (BR+) 


- fold chuue fER— \ 


KRTHBl 


1.D 


610.8 


BRS3 


1.2 


89.4 


DKKl 




69,8 


ZiCl 


1.9 


59.6 


TLRl 


1.0 


38.5 


KIAAQ6S0 


2.6 


33.2 


CDKNS 


1.0 


30.6 


EB12 


4.0 


27.9 


GZMB 


3.8 


21.9 


STK18 


4.7 


18.6 


GPR49 


1.0 


14.6 


MYO10 


1.6 


14.4 


1ADI 


-1.0 


13.5 


POLE2 


42 


13.0 


HMG4 


4.4 


12.9 


BCL2LU 


-1.2 


12.3 


LRPS 


2.9 


12.2 


CCNB2 


1.0 


11. 8 


CCNE2 


4.0 


11.6 


FOB 


-4.3 


11.1 


KNSL6 


2.9 


10.9 


H1F5 


3.0 


10 2 


SERPJNH2 


4.6 


10.2 


YAP1 


1.0 


10.0 


LPHB 


-1.3 


-10.4 


TCBA2 


-1.1 


-10.8 


TFF1 


1.3 


-11.4 


COU7A1 


-4.1 


-15.7 


POPS 


1.1 


-16.2 


BPACi 


-4.6 


-22.3 


PDZK1 


-M 


-36.8 


VECFC 


-2.8 


-51.5 


MUC6 


-1.4 


-64.9 


SERPINA5 


-1.0 


-83*1 


MEJSI 


-1.6. 


-85.9 


CA12 


2.4 


-150.3 



Tab la 3. MedCene identified a set of relatively urteratudtod, yet highly 
expressed genes in ER negative, but not ER positive breast tumors. All of 
these genes have either never been co-cited wlih breast cancer or have « 
weak association except those marked with an \ 



reflects the many genes whose role In breast cancer may not 
Involve large changes In expression In sporadic tumors (e,g.. 
URCA1 and BFCA2) and genes whose modest changes In 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more In expression level, there 
was a strong and significant correlation between expression 
level and a published role In the disease, providing the first 
global validation of the micro* array approach to Identifying 
disease-specific genes. 

The results derived from MedCene have two Implications. 
First, a careful hunt for corroborating evidence of a role In 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10* fold changes or more arc likely to be related to breast 
cancer and warrant attention. It Is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was onry found among 
ER-posltlve tumors, not ER-negatlve. This may reflect a bias 
In the literal ure lo study the more prevalent type of tumor In 
the population. Furthermore, this emphasizes that caution 
must be taken when Interpreting experiments that may contain 
subpopulatlons that behave very different |y, The MedCene 
approach Identified a set of relatively understudied, yet highly 
expressed genes In ER-negatlve tumors that are worthy of 
further examination (Table 3). 



HuetaL 

In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database Is the most comprehen- 
sive and accurate of Its kind. By generating a score that reflects 
the strength of the association, it provides an Important too] 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
for building disease* specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future. It will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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Aneuploidy and cancer 

Subrata Sen, PhD 



Numeric aberrations in chromosomes, referred to as aneu- 
ploidy t is commonly observed in human cancer. Whether aneu- 
ploidy is a cause or consequence of cancer has long been 
debated. Three lines of evidence* now make a compelling case 
for aneuploidy being a discrete chromosome mutation event 
that contributes to malignant transformation and progression 
process. First, precise assay of chromosome aneuploidy in 
several primary tumors with in situ hybridization and compara- 
tive genomic hybridization techniques have revealed that 
specific chromosome aneusomies correlate with distinct tumor 
phenotypes. Second, aneuploid tumor cell lines and in vitro 
transformed rodent cells have been reported to display an 
elevated rate of chromosome instability, thereby indicating that 
aneuploidy is a dynamic chromosome mutation event associ- 
ated with transformation of cells. Third, and most important, a 
number of mitotic genes regulating chromosome segregation 
have been found mutated in human cancer cells, implicating 
such mutations in induction of aneuploidy in tumors. Some of 
these gene mutations, possibly allowing unequal segregations 
of chromosomes, also cause tumorigenic transformation of 
cells in vitro. In this review, the recent publications investigat- 
ing aneuploidy in human cancers, rate of chromosome instabil- 
ity in aneuploidy tumor cells, and genes implicated in regulat- 
ing chromosome segregation found mutated in cancer cells 

are discussed. Curr Opin Oncol 2000, 12:82-88 © 2000 Lippincott Williams 
& Wilkins, !nc 



Cancer research over the past decade has firmly estab- 
lished that malignant cells accumulate a large number of 
genetic mutations that affect differentiation, prolifera- 
tion, and cell death processes. In addition, it is also 
recognized that most cancers are clonal, although they 
display extensive heterogeneity with respect to kary- 
otypes and phenotypes of individual clonal populations. 
It is estimated that numeric chromosomal imbalance, 
referred to as aneuploidy^ is the most prevalent genetic 
change recorded among over 20,000 solid tumors 
analyzed thus far [1]. Phenotypic diversity of the clonal 
populations in individual tumors involve differences in 
morphology, proliferative properties, antigen expression, 
drug sensitivity, and metastatic potentials. It has been 
proposed that an underlying acquired genetic instability 
is responsible for the multiple mutations detected in 
cancer cells that lead to tumor heterogeneity and 
progression [2]. In a somewhat contradictory argument, 
it has also been suggested that clonal expansion due to 
selection of cells undergoing normal rates of mutation 
can explain malignant transformation and progression 
process in humans [3]. Acquired genetic instability, 
nonetheless, is considered important for more rapid 
progression of the disease [4*«]. Although the original 
hypothesis on genetic instability in cancer primarily 
focused on chromosome imbalances in the form of aneu- 
ploidy in tumor cells, the actual relevance of such muta- 
tions in cancer remains a controversial issue. 
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Whether or not aneuploidy contributes to the malignant 
transformation and progression process has long been 
debated. A prevalent idea on genetics of cancer referred 
to as "somatic gene mutation hypothesis" contends that 
gene mutations at the nucleotide level alone can cause 
cancer by either activating cellular proto-oncogenes to 
dominant cancer causing oncogenes and/or by inactivat- 
ing growth inhibitory tumor suppressor genes. In this 
scheme of things chromosomal instability in the form of 
aneuploidy is a mere consequence rather than a cause of 
malignant transformation and progression process. 

In this review, some of the recent observations on the 
subject are discussed and compelling evidence is 
provided to suggest that aneuploidy is a distinct form of 
genetic instability in cancer that frequently correlates 
with specific phenotypes and stages of the disease. 
Furthermore, discrete genetic targets affecting chromo- 
somal stability in cancer cells, recently identified, are 
also discussed. These data provide a new direction 
toward elucidating the molecular mechanisms responsi- 
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ble for induction of aneuploidy in cancer and may even- 
tually be exploited as novel therapeutic targets in the 
future. 

Genetic alterations in cancer 

Alterations in many genetic loci regulating growth, 
senescence, and apoptosis, identified in tumor cells, 
have led to the current understanding of cancer as a 
genetic disease. The genetic changes identified in 
tumors include: subtle mutations in genes at the 
nucleotide level; chromosomal translocations leading to 
structural rearrangements in genes; and numeric 
changes in either partial segments of chromosomes or 
whole chromosomes (aneuploidy) causing imbalance in 
gene dosage. 

For the purpose of this review, both segmental and whole 
chromosome imbalances leading to altered DNA dosage 
in cancer cells are included as examples of aneuploidy. 

Incidence of aneuploidy in cancer 

Evidence of aneuploidy involving one or more chromo- 
somes have been commonly reported in human tumors. 
Although these observations were initially made using 
classic cytogenetic techniques late in a tumor's evolu- 
tion and were difficult to correlate with cancer progres- 
sion, more recent studies have reported association of 
specific nonrandom chromosome aneuploidy with 
different biologic properties such as loss of hormone 
dependence and metastatic potential [5]. 

Classic cytogenetic studies performed on tumor cells 
had serious limitations in scope because they were 
applicable only to those cases in which mitotic chromo- 
somes could be obtained. Because of low spontaneous 
rates of cell division in primary tumors, analyses 
depended on cells either derived selectively from 
advanced metastases or those grown in vitro for variable 
periods of time. In both instances, metaphases analyzed 
represented only a subset of primary tumor cell popula- 
tion. Two major advances in cytogenetic analytic tech- 
niques, in situ hybridization (ISH) and comparative 
genomic hybridization (CGH), have allowed better reso- 
lution of chromosomal aberrations in freshly isolated 
tumor cells [6]. ISH analyses with chromosome-specific 
DNA probes, a powerful adjunct to metaphasic analysis, 
allows assessment of chromosomal anomalies within 
tumor cell populations in the contexts of whole nuclear 
architecture and tissue organization. CGH allows 
genome wide screening of chromosomal anomalies 
without the use of specific probes even in the absence 
of prior knowledge of chromosomes involved. Although 
both techniques have certain limitations in terms of 
their resolution power, they nonetheless provide a 
better approximation of chromosomal changes occurring 
among tumors of various histology, grade, and stage 



compared with what was possible with the classic cyto- 
genetic techniques. Genomic ploidy measurements 
have also been performed at the DNA level with flow 
cytometry and cytofluorometric methods. Although 
these assays underestimate chromosome ploidy due to a 
chromosomal gain occasionally masking a chromosomal 
loss in the same cell, several studies using these 
methods have supported the conclusion that DNA 
aneuploidy closely associates with poor prognosis in 
various cancers [7,8]. This discussion of some recent 
examples published on aneuploidy in cancer includes 
discussion of studies dealing with DNA ploidy measure- 
ments as well. Most of these observations are correlative 
without direct proof of specific involvement of genes on 
the respective chromosomes. Identification of putative 
oncogenes and tumor suppressor genes on gained and 
lost chromosomes in aneuploid tumors, however, are 
providing strong evidence that chromosomes involved in 
aneuploidy play a critical role in the tumorigenic 
process. 

In renal tumors, either segmental or whole chromosome 
aneuploidy appears to be uniquely associated with 
specific histologic subtypes [9]. Tumors from patients 
with hereditary papillary renal carcinomas (HPRC) 
commonly show trisomy of chromosome 7, when 
analyzed by CGH. Germline mutations of a putative 
oncogene MET have been detected in patients with 
HPRC. A recent study [10] has demonstrated that an 
extra copy of chromosome 7 results in nonrandom dupli- 
cation of the mutant MET allele in HPRC, thereby 
implicating this trisomy in tumorigenesis. The study 
suggested that mutation of MET may render the cells 
more susceptible to errors in chromosome replication, 
and that clonal expansion of cells harboring duplicated 
chromosome 7 reflects their proliferative advantage. In 
addition to chromosome 7, trisomy of chromosome 17 in 
papillary tumors and also of chromosome 8 in mesoblas- 
tic nephroma are commonly seen. Association of specific 
chromosome imbalances with benign and malignant 
forms of papillary renal tumors, therefore, not only 
contribute to an understanding of tumor origins and 
evolution, but also implicate aneuploidy of the respec- 
tive chromosomes in the tumorigenic transformation 
process. 

In colorectal tumors, chromosome aneuploidy is a 
common occurrence. In fact, molecular allelotyping 
studies have suggested that limited karyotyping data 
available from these tumors actually underestimate the 
true extent of these changes. Losses of heterozygosity 
reflecting loss of the maternal or paternal allele in 
tumors are widespread and often accompanied by a gain 
of the opposite allele. Therefore, for example, a tumor 
could lose a maternal chromosome while duplicating 
the same paternal chromosome, leaving the tumor cell 
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with a normal karyotype and ploidy but an aberrant 
allelotype. It has been estimated that cancer of the 
colon, breast, pancreas, or prostate may lose an average 
of 25% of its alleles. It is not unusual to discover that a 
tumor has lost over half of its alleles [4]. In clinical 
settings, DNA ploidy measurements have revealed that 
DNA aneuploidy indicates high risk of developing 
severe premalignant changes in patients with ulcerative 
colitis, who are known to have an increased risk of 
developing colorectal cancer [11]. DNA aneuploidy has 
been found to be one of the useful indicators of lymph 
node metastasis in patients with gastric carcinoma and 
associated with poor outcome compared with diploid 
cases [12,13]. CGH analyses of chromosome aneu- 
ploidy, on the other hand, was reported to correlate gain 
of chromosome 20q with high tumor S phase fractions 
and loss of 4q with low tumor apoptotic indices [14]. 
Aneuploidy of chromosome 4 in metastatic colorectal 
cancer has recently been confirmed in studies that used 
unbiased DNA fingerprinting with arbitrarily primed 
polymerase chain reactions to detect moderate gains 
and losses of specific chromosomal DNA sequences 
[15]. The molecular karyotype (amplotype) generated 
from colorectal cancer revealed that moderate gains of 
sequences from chromosomes 8 and 13 occurred in 
most tumors, suggesting that overrepresentation of 
these chromosomal regions is a critical step for metasta- 
tic colorectal cancer. 

In addition to being implicated in tumorigenesis and 
correlated with distinct tumor phenotypes, chromosome 
aneuploidy has been used as a marker of risk assessment 
and prognosis in several other cancers. The potential 
value of aneuploidy as a noninvasive tool to identify 
individuals at high risk of developing head and neck 
cancer appears especially promising. Interphase fluores- 
cence in situ hybridization (FISH) revealed extensive 
aneuploidy in tumors from patients with head and neck 
squamous cell carcinomas (HNSCC) and also in clini- 
cally normal distant oral regions from the same individu- 
als [16,17]. It has been proposed that a panel of chromo- 
some probes in FISH analyses may serve as an 
important tool to detect subclinical tumorigenesis and 
for diagnosis of residual disease. The presence of aneu- 
ploid or tetraploid populations is seen in 90% to 95% of 
esophageal adenocarcinomas, and when seen in 
conjunction with Barrett's esophagus, a premalignant 
condition, predicts progression of disease [18,19]. 
Chromosome ploidy analyses in conjunction with loss of 
heterozygosity and gene mutation studies in Barrett's 
esophagus reflect evolution of neoplastic cell lineages in 
vivo [20]. Evolution of neoplastic progeny from Barrett's 
esophagus following somatic genetic mutations 
frequently involves bifurcations and loss of heterozygos- 
ity at several chromosomal loci leading to aneuploidy 
and cancer. Accordingly, it is hypothesized that during 



tumor cell evolution diploid cell progenitors with 
somatic genetic abnormalities undergo expansion with 
acquired genetic instability. Such instability, often 
manifested in the form of increased incidence of aneu- 
ploidy, enters a phase of clonal evolution beginning in 
premalignant cells that proceeds over a period of time 
and occasionally leads to malignant transformation. The 
clonal evolution continues even after the emergence of 
cancer. 

The significance of DNA and chromosome aneu- 
ploidy in other human cancers continue to be evalu- 
ated. Among papillary thyroid carcinomas, aneuploid 
DNA content in tumor cells was reported to correlate 
with distant metastases, reflecting worsened progno- 
sis [21]. Genome wide screening of follicular thyroid 
tumors by CGH, on the other hand, revealed frequent 
loss of chromosome 22 in widely invasive follicular 
carcinomas [22]. Chromosome copy number gains in 
invasive neoplasm compared with foci of ductal carci- 
noma in situ (DCIS) with similar histology have been 
proposed to indicate involvement of aneuploidy in 
progression of human breast cancer [23]. ISH analyses 
of cervical intraepithelial neoplasia has provided 
suggestive evidence that chromosomes 1, 7 and X 
aneusomy is associated with progression toward cervi- 
cal carcinoma [24]. 

Although the prognostic value of numeric aberrations 
remains a matter of debate in human hematopoietic 
neoplasia, there have been recent studies to suggest that 
the presence of monosomy 7 defines a distinct subgroup 
of acute myeloid leukemia patients [25]. It is interesting 
in this context that therapy-related myelodysplastic 
syndromes have been reported to display monosomy 5 
and 7 karyotypes, reflecting poor prognosis [26]. 

The clinical observations, mentioned previously, are 
supported by in vitro studies in human and rodent cells in 
which aneuploidy is induced at early stages of transforma- 
tion [27,28]. It is even suggested that aneuploidy may 
cause cell immortalization, in some instances, that is a 
critical step preceeding transformation. 

Finally, in an interesting study to develop transgenic 
mouse models of human chromosomal diseases, chromo- 
some segment specific duplication and deletions of the 
genome were reported to be constructed in mouse 
embryonic stem cells [29]. Three duplications for a 
portion of mouse chromosome 11 syntenic with human 
chromosome 17 were established in the mouse 
germline. Mice with 1Mb duplication developed corneal 
hyperplasia and thymic tumors. The findings represent 
the first transgenic mouse model of aneuploidy of a 
defined chromosome segment that documents the direct 
role of chromosome aneusomy in tumorigenesis. 
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Aneuploidy as "dynamic cancer-causing 
mutation" instead of a "consequential state" 
in cancer 

According to the hypothesis previously discussed, aneu- 
ploidy represents either a "gain of function" or "loss of 
function" mutation at the chromosome level with a 
causative influence on the tumorigenesis process. The 
hypothesis, however, is based only on circumstantial 
evidence even though existence of aneuploidy is corre- 
lated with different tumor phenotypes. The existence of 
numeric chromosomal alterations in a tumor does not 
mean that the change arose as a dynamic mutation due 
to genomic instability, because several factors could lead 
to consequential aneuploidy in tumors, also. Although 
aneuploidy as a dynamic mutation due to genomic insta- 
bility in tumor cells would occur at a certain measurable 
rate per cell generation, a consequential state of aneu- 
ploidy in tumors may not occur at a predictable rate 
under similar conditions or in tumors with similar 
phenotypes. In addition to genomic instability, differ- 
ences in environmental factors with selective pressure, 
could explain high incidence of aneuploidy and other 
somatic mutations in tumors compared with normal cells 
[4]. These include humoral, cell substratum, and cell- 
cell interaction differences between tumor and normal 
cell environments. It could be argued that despite 
similar rates of spontaneous aneuploidy induction in 
normal and tumor cells, the latter are selected to prolif- 
erate due to altered selective pressure in the tumor cell 
environment, whereas the normal cells are eliminated 
through activation of apoptosis. Alternatively, of course, 
one could postulate that selective expression or overex- 
pression of anti-apoptotic proteins or inactivation of 
proapoptotic proteins in tumor cells may counteract 
default induction of apoptosis in G2/M phase cells 
undergoing missegregation of chromosomes. Recent 
demonstration of overexpression of a G2/M phase anti- 
apoptotic protein survivin in cancer cells [30] suggests 
that this protein may favor aberrant progression of aneu- 
ploid transformed cells through mitosis. This would 
then lead to proliferation of aneuploid cell lineages, 
which may undergo clonal evolution. 

To ascertain that aneuploidy is a dynamic mutational 
event, various human tumor cell lines and transformed 
rodent cell lines have been analyzed for the rate of 
aneuploidy induction. When grown under controlled in 
vitro conditions, such conditions ensure that environ- 
mental factors do not influence selective proliferation of 
cells with chromosome instability. In one study, 
Lengauer et aL [31»] provided unequivocal evidence by 
FISH analyses that losses or gains of multiple chromo- 
somes occurred in excess of 10" 2 per chromosome per 
generation in aneuploid colorectal cancer cell lines. The 
study further concluded that such chromosomal instabil- 
ity appeared to be a dominant trait. Using another in 



vitro model system of Chinese hamster embryo (CHE) 
cells, Duesberg et aL [32 # ] have also obtained similar 
results. With clonal cultures of CHE cells, transformed 
with nongenotoxic chemicals and a mitotic inhibitor, 
these authors demonstrated that the overwhelming 
majority of the transformed colonies contained more 
than 50% aneuploid cells, indicating that aneuploidy 
would have originated from the same cells that under- 
went transformation. All the transformed colonies tested 
were tumorigenic. It was further documented that the 
ploidy factor representing the quotient of the modal 
chromosome number divided by the normal diploid 
number, in each clone, correlated directly with the 
degree of chromosomal instability. Therefore, chromo- 
somal instability was found proportional to the degree of 
aneuploidy in the transformed cells and the authors 
hypothesized that aneuploidy is a unique mechanism of 
simultaneously altering and destabilizing, in a massive 
manner, the normal cellular phenotypes. In the absence 
of any evidence that the transforming chemicals used in 
the study did not induce other somatic mutations, it is 
difficult to rule out the contribution of such mutations 
in the transformation process. These results nonetheless 
make a strong case for aneuploidy being a dynamic chro- 
mosome mutation event intimately associated with 
cancer. 

Aneuploidy versus somatic gene mutation in 
cancer 

The idea that numeric chromosome imbalance or aneu- 
ploidy is a direct cause of cancer was proposed at the 
turn of the century by Theodore Boveri [33]. However, 
the hypothesis was largely ignored over the last several 
decades in favor of the somatic gene mutation hypothe- 
sis, mentioned earlier. Evidence accumulating in the 
literature lately on specific chromosome aneusomies 
recognized in primary tumors, incidence of aneuploidy 
in cells undergoing transformation, and aneuploid tumor 
cells showing a high rate of chromosome instability have 
led to the rejuvenation of Boveri's hypothesis. The 
concept has recently been discussed as a "vintage wine 
in a new bottle" [34*]. The author points out that 
except for rare cancers caused by dominant retroviral 
oncogenes, diploidy does not seem to occur in solid 
tumors, whereas aneuploidy is a rule rather than excep- 
tion in cancer. 

Aneuploidy as an effective mutagenic mechanism 
driving tumor progression, on the other hand, is being 
recognized as a viable solution to the paradox that with 
known mutation rate in non-germline cells (~10~ 7 per 
gene per cell generation) tumor cell lineages cannot 
accumulate enough mutant genes during a human life- 
time [35]. The concept is gaining significant credibility 
since genes that potentially affect chromosome segrega- 
tion were found mutated in human cancer. Some of 
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these genes have also been shown to have transforming 
capability in in vitro assays. Selected recent publications 
describing the findings are being discussed below in 
reference to the mitotic targets potentially involved in 
inducing chromosome segregation anomalies in cells. 

Potential mitotic targets and molecular 
mechanisms of aneuploidy 

Because aneuploidy represents numeric imbalance in 
chromosomes, it is reasonable to expect that aneuploidy 
arises due to missegregation of chromosomes during cell 
division. There are many potential mitotic targets, 
which could cause unequal segregation of chromosomes 
(Fig. 1). Recent investigations have identified several 
genes involved in regulating these mitotic targets and 
mitotic checkpoint functions, which can be implicated 
in induction of aneuploidy in tumor cells. This discus- 
sion is restricted to those mitotic targets and checkpoint 
genes whose abnormal functioning has been observed in 
cancer or has been shown to cause tumorigenic transfor- 
mation of cells, in recent years. The role of telomeres is 
discussed elsewhere in this issue. For a more detailed 
description of the components of mitotic machinery and 
their possible involvement in causing chromosome 
segregation abnormalities in tumor cells, readers may 
refer to a recently published review [36 # ]. 

Among the mitotic targets implicated in cancer, centro- 
some defects have been observed in a wide variety of 
malignant human tumors. Centrosomes play a central role 
in organizingthe microtubule network in interphase cells 
and mitotic spindle during cell division. Multipolar 
mitotic spindles have been observed in human cancers in 
situ and abnormalities in the form of supernumerary 

Figure 1. Potential mitotic targets causing aneuploidy in 
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Diagram illustrates that defects in several processes involving chromosomal, 
spindle microtubule, and centrosomal targets, in addition to abnormal cytokine- 
sis, may cause unequal partitioning of chromosomes during mitosis, leading to 
aneuploidy, Recently obtained evidence in favor of some of these possibilities is 
discussed in the text. 



centrosomes, centrosomes of aberrant size and shape as 
well as aberrant phosphorylation of centrosome proteins 
have been reported in prostate, colon, brain, and breast 
tumors [37,38]. In view of the findings that abnormal 
centrosomes retain the ability to nucleate microtubules in 
vitro, it is conceivable that cells with abnormal centro- 
somes may missegregate chromosomes producing aneu- 
ploid cells. The molecular and genetic bases of abnormal 
centrosome generation and the precise pathway through 
which they regulate the chromosome segregation process 
remain to be elucidated. Recent discovery of a centro- 
some-associated kinase STK15/BTAK/aurora2, naturally 
amplified and overexpressed in human cancers, has raised 
the interesting possibility that aberrant expression of this 
kinase is critically involved in abnormal centrosome func- 
tion and unequal chromosome segregation in tumor cells 
[39,40]. Exogenous expression of the kinase in rodent and 
human cells was found to correlate with an abnormal 
number of centrosomes, unequal partitioning of chromo- 
somes during division, and tumorigenic transformation of 
cells. It is relevant in this context to mention that the 
Xenopus homologue of human STK15/BTAK/aurora2 
kinase has recently been shown to phosphorylate a micro- 
tubule motor protein XlEgS, the human orthologue of 
which is known to participate in the centrosome separa- 
tion during mitosis [41]. Findings on STK15/aurora2 
kinase, thus, provide an interesting lead to a possible 
molecular mechanism of centrosome's role in oncogene- 
sis. Centrosomes have, of late, been implicated in onco- 
genesis from studies revealing supernumerary centro- 
somes in />53-deficient fibroblasts and overexpression of 
another centrosome kinase PLK1 being detected in 
human non-small cell lung cancer [42]. 

One of the critical events that ensures equal partition- 
ing of the chromosomes during mitosis is the proper 
and timely separation of sister chromatids that are 
attached to each other and to the mitotic spindle. 
Untimely separation of sister chromatids has been 
suspected as a cause of aneuploidy in human tumors. 
Cohesion between sister chromatids is established 
during replication of chromosomes and is retained until 
the next metaphase/anaphase transition. It has been 
shown that during metaphase-anaphase transition, the 
anaphase promoting complex/cyclosome triggers the 
degradation of a group of proteins called securins that 
inhibit sister chromatid separation. A vertebrate securin 
(v-securin) has recently been identified that inhibits 
sister chromatid separation and is involved in transfor- 
mation and tumorigenesis. Subsequent analysis 
revealed that the human securin is identical to the 
product of the gene called pituitary tumor transforming 
gene, which is overexpressed in some tumors and 
exhibits transforming activity in NIH3T3 cells. It is 
proposed that elevated expression of the v-securin may 
contribute to generation of malignant tumors due to 
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chromosome gain or loss produced by errors in chro- 
matid separation [43*]. 

Normal progression through mitosis during prophase to 
anaphase transition is monitored at least at two check- 
points. One checkpoint operates during early prophase 
at G2 to metaphase progression while the second 
ensures proper segregation of chromosomes during 
metaphase to anaphase transition. Several mitotic 
checkpoint genes responding to mitotic spindle defects 
have been identified in yeast. The metaphase-anaphase 
transition is delayed following activation of this check- 
point during which kinetochores remain unattached to 
the spindle. The signal is transmitted through a kineto- 
chore protein complex consisting of Mpslp and several 
Mad and Bub proteins [44]. It is expected that for 
unequal chromosome segregation to be perpetuated 
through cell proliferation cycles giving rise to aneu- 
ploidy, checkpoint controls have to be abrogated. 

Following this logic, Vogelstein eta/. [45»] hypothesized 
that aneuploid tumors would reveal mutation in mitotic 
spindle checkpoint genes. Subsequent studies by these 
investigators have proven the validity of this hypothesis 
and a small fraction of human colorectal cancers have 
revealed the presence of mutations in either hBubl or 
hBubRl checkpoint genes. It was further revealed that 
mutant BUB1 could function in a dominant negative 
manner conferring an abnormal spindle checkpoint 
when expressed exogenously. Inactivation of spindle 
checkpoint function in virally induced leukemia has also 
recently been documented following the finding that 
hMADl checkpoint protein is targeted by the Tax 
protein of the human T-cell leukemia virus type 1. 
Abrogation of hMADl function leads to multinucleation 
and aneuploidy [46]. 

In addition to mitotic spindle checkpoint defects, failed 
DNA damage checkpoint function in yeast is frequently 
associated with aberrant chromosome segregation as 
well. It, therefore, appears intriguing yet relevant that 
the human BRCA1 gene, proposed to be involved in 
DNA damage checkpoint function, when mutated by a 
targeted deletion of exon 1 1 led to defective G2/M cell 
cycle checkpoint function and genetic instability in 
mouse embryonic fibroblasts [47]. The cells revealed 
multiple functional centrosomes and unequal chromo- 
some segregation and aneuploidy. Although the molecu- 
lar basis for these abnormalities is not known at this 
time, it raises the interesting possibility that such an 
aneuploidy-driven mechanism may be involved in 
tumorigenesis in individuals carrying germline muta- 
tions of BRCA1 gene. 



Conclusion 

Growing evidence from human tumor cytogenetic inves- 
tigations strongly suggest that aneuploidy is associated 
with the development of tumor phenotypes. Clinical 
findings of correlation between aneuploidy and tumori- 
genesis are supported by studies with in vitro grown 
transformed cell lines. Molecular genetic analyses of 
tumor cells provide credible evidence that mutations in 
genes controlling chromosome segregation during 
mitosis play a critical role in causing chromosome insta- 
bility leading to aneuploidy in cancer. Further elucida- 
tion of molecular and physiologic bases of chromosome 
instability and aneuploidy induction could lead to the 
development of new therapeutic approaches for 
common forms of cancer. 
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